WO9840477 fhttp://www.getthepatentxom/Loqin.d<^ ^nk99/ Fetch/WO9840477 



Page 1 of 108 



PCX 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 




INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCI) 



(51) International Patent Classification ^ : 

C12N 15/00, 15/09, C07H 21/02, 21A)4 



Al 



(11) International Publication Number: WO 98/40477 

(43) International Publication Date: 17 September 1996 (17.09.98) 



(21) International Application Number: 

(22) International Filing Date: 



PCrAJS98/04978 
13 March 1998 (13.03.98) 



(30) Priority Data: 
08/818.252 
08/818.253 
08/919.143 



14 March 1997 (14.03.97) 
14 March 1997 (14.03.97) 
27 August 1997 (27.08.97) 



US 
US 

us 



(71) Applicant: THE REGENTS OF THE UNIVERSITY OF 

CALIFORNIA [US/US]; 22nd floor, 300 Lakeside Drive, 
Oakland. CA 94612-3550 (US). 

(72) Inventors: TSIEN, Roger, Y.; 8535 Nottingham Place, La 

Jolla, CA 92037 (US). MIYAWAKI, Atsushl; 3899 Nobel 
Drive, San Diego, CA 92122 (US). 

(74) Agent: HAILE, Usa, A.; Fish & Richardscm P.C., Suite 1400, 
4225 Executive Square, U Jolla, CA 92037 (US). 



(81) Designated States: AL, AM, AT. AU, AZ, BA, BB. BG. BR, 
BY, CA, CH. CN. CU, CZ, DE. DK. EE. ES. FI. GB, GE, 
GH, HU, IL. IS. JP. KB, KG. KP. KR, KZ, LC. LK, LR, 
LS, LT. LU. LV. MD. MG, MK, MN. MW, MX. NO. NZ, 
PL, FT. RO. RU. SD. SE, SG. SI, SK. SL, TJ, TM, TR. 
TT, UA, UG, UZ, VN, YU. ZW. ARIPO patent (GH, GM, 
KE. LS. MW. SD, SZ. UG, ZW). Eurasian patent (AM, AZ, 
BY. KG, KZ, MD, RU, TJ, TM), European patent (AT, BE. 
CH, DE, DK. ES. FI, FR, GB. GR, IE. FT, LU. MC, NL. 
PT, SE), OAPI patent (BF. BJ. CF, CG, CI,.CM, GA, GN, 
ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Bejore the expiration of the time iimit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: FLUORESCENT PROIEIN SENSORS FOR DETECTION OF ANALYTBS 
(57) Abstract 

Fluorescent Indicators including a binding protein moiety, a donor fluorescent protein moiety, and an acceptor fluorescent proteto 
moiety are described. The bbidlng protein moiety has an analyt&^binding region which binds an analyte and causes die indicator to change 
conformation upon exposure to the analyte. The donor moiety and the acceptor moiety change posidon reladve to each odier when the 
analyte binds to the analyte-binding region. The donor moiety and the acceptor moiety exhibit fluorescence resonance energy transfer when 
the donor moiety is excited and the distance between the donor moiety and the acceptor moiety is small. The indicators can be used to 
measure analyte concentrations is samples, such as calcium ion concentrations in cells. 




W0984(M77 fhttp://WAW.getthepatent.corn/Login r1n^^ii|i'l')/l eh h/WO9840477xpc?froml n In l|i ill ii hull | 



Page 2 of 108 



FOR THE PUltPOSBS OP INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albanta 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Annenla 


FI 


Finland 


LT 


LItaoia 


SK 


Slovalcia 


AT 


Ausula 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabcm 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azeibaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Henegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajildstan 


BE 


Belgmro 


GN 


Guinea 


MK 


The fonner Yugoslav 


TM 


'Hirlonenistan 


BP 


Buridna Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Today 


BG 


Bu^arta 


HU 


Hungary 


ML 


MaU 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mcogolia 


UA 


Ukraine 


BR 


BrazS 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CP 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyigyzstan 


NO 


Norway 


ZW 


Zfanbabwe 


CI 


CAte d'lvoire 


KP 


Democratic People** 


NZ 


NewZeahuxl 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Kofea 


PT 


Pntugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Luda 


RU 


Russian Federadon 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


DennsBik 


LK 


Sri Lanka 


SB 


Sweden 






EE 


Estonia 


LR 


Uberia 


SG 


SfaigapMB 







t 



i. 



WO9840477 fhttp://www.getthepat rnr rnm/l nqin rinr^^il 'l'l/l cli li/ WQ9840477xpc?fromCache==1par1 =mair^||^nhnttn 



WO 98/40477 PCT/US98/04978 



- 1 - 



FLUORESCENT PROTEIN SENSORS FOR DETECTIQN OF ANALYTES 

Statement as to Federally Sponsored Research 
5 This mvention was made with Government support under Grant No. NS27 1 77, awarded 
by the National Institutes of Health. The Government has certain rights in this invention. 

Background of the Invention 
The invention relates to fluorescent protein sensors for detecting and quantifying 
analytes. 

10 Measurement of an analyte concentration in vitro or in vivo by non-invasive techniques 
can help elucidate the physiological function of the analyte. This can also aid in 
identifying changes that occur in a cell or organism in response to physiological stimuli. 
For example, cyclic AMP can be detected by fluorescence resonance energy transfer 
between a separately labeled proteins that associate with each other but are not 

15 covalently attached to each other. See, U.S. Pat No. 5,439,797. 

For example, many effects of Ca^^ in cells are mediated by Cs?^ binding to calmodulin 
(CaM), which causes CaM to bind and activate target proteins or peptide sequences. 
Based on the NMR solution structure of CaM bound to the 26-residue Ml 3 Ca^^-binding 
peptide of myosin light-chain kinase, Porumb et al fused the C-terminus of CaM via a 
20 Gly-Gly spacer to the M13. Ca^"^ binding switches the resulting hybrid protein 
(CaM-M13) from a dimibbell-like extended form to a compact globular form similar to 

■ 

the CaM-M13 intermolecular complex. See, Porumb, T., et aL, Frot.Engineering 
7:109-115(1994). 

Fluorescent Ca^^ indicators such as fura-2, indo-1 , fluo-3, and Calcium-Green have been 
25 the mainstay of mtracellular Ca^^ measurement and imaging. See, for example, U.S. Pat. 
No. 4,603,209 and U.S. Pat. No. 5,049,673. These relatively low molecular weight 
indicators can suffer from many technical problems relating to ester loading, leakage of 
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the dyes fix>m the cell, compaitmentation in organelles, and perturbation of the indicators 
by cellular constituents. Although the Ca^^-indicating photoprotein aequorin is 
targetable, the photoresponse to Ca^^ is low since it is chemiluminescent Moreover, 
aequorins need to incorporate exogenous coelenterazine. 



This invention provides fluorescent indicators and methods for using them to determine 
the concentration of an analyte both in vitro said in vivo. In one aspect, the fluorescent 
indicator includes a binding protein moiety, a donor fluorescent protein moiety, and an 
acceptor fluorescent protein moiety. The binding protein moiety has an analyte-binding 

10 region which binds an analyte and caiises the indicator to change confoimation upon 
exposure to the analyte. The donor fluorescent protein moiety is covalently coupled to 
the binding protein moiety. The acceptor fluorescent protein moiety is covalently 
coupled to the binding protem moiety. In the fluorescent indicator, the donor moiety and 
the acceptor moiety change position relative to each other when the analyte binds to the 

15 analyte-binding region, altering fluorescence resonance energy transfer between the 
donor moiety and the acceptor moiety when the donor moiety is excited. 

The donor fluorescent protein moiety and the acceptor fluorescent protein moiety can be 
Aequorea-rdatcd fluorescent protein moieties. Preferably, the donor fluorescent protein 
moiety is P4-3, EBFP, or WIB, and the acceptor fluorescent protein moiety is S65T, 
20 EGFP,orlOc. 

In preferred embodiments, the indicator further includes the target peptide moiety and 
a linker moiety that covalently couples the binding protein and the target peptide moiety. 
The binding protein moiety further includes a peptide-binding region for binding the 
target peptide moiety. The binding protein moiety can be covalently coupled to the 
25 donor fluorescent protein moiety and the target peptide moiety can be covalently coupled 
to the acceptor fluorescent protein moiety. 



5 



Summary of the Invention 
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The indicator can be a single polypeptide. In preferred embodiments, one of the donor 
fluorescent protein moiety or the acceptor fluorescent protein moiety is covalently 
coupled to the carboxy terminus of the single polypeptide and the other of the donor 
fluorescent protein moiety or the acceptor fluorescent protein moiety is covalently 
5 coupled to the amino terminus of the single polypeptide. 

The indicator can include a localization sequence. The localization sequence can be a 
nuclear localization sequence, an endoplasmic reticulum localization sequence, a 
peroxisome localization sequence, a mitochondrial import sequence, a mitochondrial 
localization sequence, or a localized protein. 

10 In preferred embodiments, the linker moiety is a peptide moiety. The linker moiety can 
include between about 1 amino acid residue and about 20 amino acid residues. The 
linker moiety can be -Gly-Gly-. 

Preferably, the binding protein moiety is cahnodulin, a cahnodulin-related protem 
moiety, cGMP-dependent protein kinase, a steroid hormone receptor, a ligand binding 
15 domain of a steroid hormone receptor, protein kinase C, inositol- l,4,S-triphosphate 
r&cq)tor, or recoverin. A calmodulin-related protein moiety is derived fix)m calmodulin 
that has been modified to have a different binding affmity for calcium or a target peptide 
moiety. 

Most preferably, the binding protein moiety is calmodulin or a calmodulin-related protein 
20 moiety. In these embodiments, the target peptide moiety can be a subsequence of a 
calmodulin*binding domain of Mli3, smMLCKp, CaMKII, Caldesmon, Calspennin, 
Calcineurin, PhK5, PhK13, C28W, 59.kDa PDE, 60-kDa PDE, NO-30, AC-28, 
Bordetella pertussis AC, Neuromodulin, Spectrin, MARCKS, F52, B-Adducin, HSP90a, 
HIV-1 gpl60, BBMHBI, Dilute MHC, Mastoparan, Melittin, Glucagon, Secretin, VIP, 
25 GIP, or Model Peptide CBP2. Preferably, the target peptide moiety is Ml 3. 
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In another aspect, the invention features a fluorescent indicator including a target peptide 
moiety, a binding protein moiety, a linker moiety, a donor fluorescent protein moiety 
cpvalently coupled to the binding protein moiety, and an acceptor fluorescent protein 
moiety covalently coupled to the binding protein moiety. The binding protein moiety has 
5 an analyte-binding region which binds an analyte and causes the indicator to change 
conformation upon exposure to the analyte. The linker moiety covalentiy couples the 
binding protein and the target peptide moiety and is a peptide moiety. The binding 
protein moiety has a peptide-binding region for binding the target peptide moiety. The 
indicator is a single polypeptide. 

10 In another aspect, the invention features a method for determining the concentration of 
an analyte in a sample. The method includes the steps of contacting the sample with a 
fluorescent indicator having a donor fluorescent protein moiety, binding protein moiety, 
and acceptor protein moiety, exciting the donor moiety, and detennining the degree of 
fluorescence resonance energy transfer in the sample corresponding to the concentration 

15 of the analyte in the sample. 

In preferred embodiments, the step of determining the degree of fluorescence resonance 
energy transfer in the sample includes measuring tight emitted by the acceptor 
fluorescent protein moiety. In other preferred embodiments, determinmg the degree of 
fluorescence resonance energy transfer in the sample includes measuring light emitted 

20 from the donor fluorescent protein moiety, measuring light emitted from the acceptor 
fluorescent protein moiety, and calculating a ratio of the light emitted from the donor 
fluorescent protein moiety and the light emitted from the acceptor fluorescent protein 
moiety. In yet other preferred embodiments, determining the degree of fluorescence 
resonance energy transfer in the sample includes measuring the excited state lifetime of 

25 the donor moiety. 



The method can further include the steps of determining the concentration of the analyte 
at a first time after contacting the sample with the fluorescence indicator, determining the 
concentration of the analyte at a second time after contacting the sample with the 
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fluorescence indicator, and calculating the difference in the concentration of the analyte 
at the first time and the second time, whereby the difference in the concentration of the 
analyte in the sample reflects a change in concentration of the analyte present in the 
sample. 

5 In other embodiments, the method can further include the step of contacting the sample 
with a compound between the first time and the second time, whereby a difference in the 
concentration of the analyte in the sample between the first time and the second time 
indicates that the compound alters the presence of the analyte. 

In preferred embodiments, the sample includes an intact cell and the contactmg step 
10 includes incorporating the fluorescent indicator into the cell. The step of incorporating 
the fluorescent indicator into the cell can include transfecting the cell with an expression 
vector comprising expression control sequences operably linked to a nucleic acid 
sequence coding for the expression of the fluorescent indicator. The preferred analyte 
is calciimi. 

15 In yet another aspect, the invention features an isolated nucleic acid sequence which 
encodes the fluorescent indicator. In preferred embodiments, an expression vector or a 
transgenic non-human animal includes the nucleic acid sequence. 

In another aspect, the invention features an expression vector including expression 
control sequences operatively linked to a nucleic acid sequence coding for the expression 
20 of the fluorescent indicator. The expression vector can be adapted for function in a 
prokaiyotic cell or a eukaryotic cell. 

In another aspect of the invention, a host cell transfected with an expression vector can 
include an expression control sequence operatively linked to a sequence coding for the 
expression of the fluorescent indicator. The cell can be a prokaryote, such as K coli, or 
25 a eukaryotic cell, such as a yeast cell or mammalian cell. 
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In another aspect, the invention features a transgenic non-human animal having a 
phenotype characterized by expression of the nucleic acid sequence coding for the 
expression of the fluorescent indicator. The phenotype is conferred by a transgene 
contained in the somatic and germ cells of the animal. The animal can be a mouse. 

5 In another aspect, the invention features a method for producing a transgenic non-human 
animal having a phenotype characterized by expression of the nucleic acid sequence 
coding for the expression of the fluorescent indicator. The method includes the steps of: 
(a) introducing a transgene into a zygote of an animal, the transgene comprising a DNA 
construct encoding the fluorescent indicator; (b) transplanting the zygote into a 

10 pseudopregnant animal; (c) allowing the zygote to develop to term; and (d) identifying 
at least one transgenic offspring containing the transgene. The step of introducing of the 
transgene into the embryo can be by introducing an embryonic stem cell containing the 
transgene into the embryo, or infecting the embryo with a retrovirus containing the 
transgene. 

15 "Peptide" refers to a polymer in which the monomers are amino acid residues which are 
joined together through amide bonds, alternatively referred to as a polypeptide. A 
"single polypeptide" is a continuous peptide that constitutes the protem. When the amino 
acids are alpha-amino acids, either the L-optical isomer or the D-optical isomer can be 
used, the L-isomers being preferred. Additionally, unnatural amino acids such as beta- 

20 alanine, phenylglycine, and homoarginine are meant to be included. Commonly 
encountered amino acids which are not gene-encoded can also be used in the present 
invention, although preferred amino acids are those that are encodable. For a general 
review, see, for example, Spatola, A.F., in Chemistry and Biochemistry of Amino Acids, 
Peptides and Proteins, B. Weinstein, ed., Marcel Dekker, New York, p. 267 (1983), 

25 "Fluorescent protein" refers to any protein capable of emitting light when excited with 
appropriate electromagnetic radiation. Fluorescent proteins include proteins having 
amino acid sequences that are either natural or engineered, such as the fluorescent 
proteins derived from Aequorea-relsiXGd fluorescent proteins. 
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In FRET, the "donor fluorescent protein moiety" and the "acceptor fluorescent protein 
moiety" are selected so that the donor and acceptor moieties exhibit fluorescence 
resonance energy transfer when the donor moiety is excited. One factor to be considered 
in choosing the donor/acceptor fluorescent protein moiety pair is the efficiency of FRET 
5 between the two moieties. Preferably, the efficiency of FRET between the donor and 
acceptor moieties is at least 10%, more preferably at least 50%, and most preferably at 
least 80%. The efficiency of FRET can be tested empirically using the methods 
described herein and known in the art, particularly, using the conditions set forth in the 
Examples. 

10 "Covalently coupled" refers to a covalent bond or other covalent linkage between two 
moieties. The covalent linkage can include adimdical moiety linking to two moieties. 

"Binding protein" refers to a protein capable of binding an analyte. Preferred binding 
proteins change conformation upon binding the analyte. "Target peptide" refers to a 

• • • 

peptide that can bind to the binding protein. The target peptide can be a subsequence of 
1 5 a peptide that binds to the bmding protein. 

"Analyte" refers to a molecule or ion in solution that binds to the binding protein, causing 
it to change conformation. Preferably, the analyte binds reversibly to the binding protein. 

"Moiety" refers to a radical of a molecule that is attached to another radical of the 
indicator. Thus, a "fluorescent protein moiety" is the radical of a fluorescent protein 
20 coupled to a bindmg protein moiety or a linker moiety, a "binding protein moiety" is a 
radical of a binding protein coupled to a fluorescent protein moiety, a "target peptide 
moiety" is a radical of a target peptide of the binding protein, and a "linker moiety" refers 
to the radical of a molecular linker that is ultimately coupled to both the donor and 
acceptor fluorescent protein moieties. 

25 "Operatively linked" refers to a juxtaposition wherein the components so described are 
in a relationship permitting them to function in their intended manner. A control 
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sequence operatively linked to a coding sequence is ligated such that expression of the 
coding sequence is achieved under conditions compatible with the control sequences. 
"Control sequence" refers to polynucleotide sequences which are necessary to effect the 
expression of coding and non-coding sequences to which they are ligated. Control 
5 sequences generally include promoter, ribosomal binding site, and transcription 
termination sequence. The term "control sequences" is intended to include, at a 
minimum, components whose presence can influence expression, and can also include 
additional components whose presence is advantageous, for example, leader sequences 
and fusion partner sequences. 

10 "Polynucleotide" refers to a polymeric form of nucleotides of at least 10 bases in length. 
The nucleotides can be ribonucleotides, deoxynucleotides, or modified forms of either 
type of nucleotide. The term includes single and double stranded forms of DNA. - 

The invention can have one or more of the following advantages. Ligand-induced 
conformational changes can be monitored by FRET if, for example, the amino and 

15 carboxy termini of the binding protein are ftised to a donor and acceptor GFP. This 
approach has several advantages over the usual covalent labeling with fluorescent probes. 
The fluorescent indicator can be generated in situ by gene transfer into the cells or 
organisms. This approach avoids the need to express and purify large quantities of 
soluble recombinant protein, purify and label it in vitro^ niicroinject it back into cells. 

20 The fluorescent indicator can be targeted to cellular structures. The sites of fusion 
between the moieties of the fluorescent indicator are exactly defined, giving a 
molecularly homogenous product without relying on elaborate protein chemistry. In 
addition, the chromophore of GFP is fixed in the protein. See, Ormo,M.,e/ a/., Sc/ewce 
273:1392-1395 (1996). If the GFP donor and acceptor are fused to a host protein rigidly, 

25 minor changes in the relative orientation of the ends of the latter would alter FRET. In 
contrast, most conventional fluorescent labels are attached by flexible Unkers that at least 
partially decouple the fluorophore orientation from 
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that of the protein to which it is attached, limiting the sensitivity of the FRET 
measurement. 

Other features or advantages of the present invention will be apparent from the following 
detailed description of the invention, and also from the claims. 

5 Brief Description of the Drawings 

FIG. 1 is a schematic diagram depicting a fluorescent indicator that measures the 
concentration of an analyte by fluorescence resonance energy transfer. 

FIG. 2a is a schematic diagram depicting the structure of chimera proteins containing 
P4-3, CaM-M13, and S65T, where HIS is the amino-terminal tag peptide containing the 
10 polyhistidine sequence and XCaM is Xenopus cahnodulin. 

FIG. 2b is a diagram depicting the amino acid and nucleotide sequences of the 
boimdaries between P4-3 and CaM and between M13 and S6ST in cameleon-1 . 



FIG. 3 is a graph depicting the emission spectra of cameleon-1 before and after addition 
of 2 mM CaClj to give 1 mM free Cs?\ 

15 FIG. 4a is a graph depicting the change in emission spectrum of cameleon-1 on titration 
with Ca^* when excited at 3 80 nm. 

FIG. 4b is a graph depicting Ca^ titration curves of cameleon-1 (open circles) and 
cameleon-1 /El 04Q (solid circles). 

FIG. 5 is a schematic diagram depicting the structures of cameleon-2 and cameleon-3, 
20 and their derivatives, cameleon-2nu and cameleon-3ery where Kz is Kozak's consensus 
sequence. 



wo 98/40477 PCTAJS98/04978 

-10- 

FIG. 6a is a digital fluorescence image depicting fluorescence of cameleon-2 localized 
to the cytosol of HeLa cells, where the bar is 10 Tm. 

FIG. 6b is a graph depicting temporal changes in the emission ratio of cameleon-2 for 
each of the HeLa cells shown in FIG. 6a. 

S FIG. 6c is a digital fluorescence image depicting fluorescence of cameIeon-2nu localized 
to the nuclei of HeLa cells, where the bar is 10 Tm. 

FIG. 6d is a graph depicting temporal changes in the emission ratio of cameleon-2nu in 
the two nuclei shown in FIG. 6c. 

FIG. 6e is a digital fluorescence image depicting fluorescence of cameleon-3er in 
1 0 transfected HeLa cells, where the bar is 1 0 Tm. 

FIG. 6f is a graph depicting the time-course of emission ratio of cameleon-3er (average 
of four cells). 

FIG. 7 depicts the nucleotide sequence (SEQ ID NO: 1) and amino acid sequence (SEQ 
ID NO:2) of cameleon-2. 

15 FIG. 8 depicts the nucleotide sequence (SEQ ID N0:3) and amino acid sequence (SEQ 
ID N0:4) of cameleon-2nu. 

FIG. 9 depicts the nucleotide sequence (SEQ ID NO: 5) and amino acid sequence (SEQ 
ID NO:6) of cameleon-3. 

FIG. 10 is a Ust depicting the nucleotide sequence (SEQ ID N0:7) and amino acid 
20 sequence of cameleon-3er (SEQ ID N0:8). 
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Detailed Description 

A fluorescent indicator that utilizes fluorescent resonance energy transfer ("FRET") to 
measure the concentration of an analyte includes two fluorescent protein moieties having 
emission and excitation spectra that render one a donor fluorescent protein moiety and 

5 the other an acceptor fluorescent protein moiety. The fluorescent protein moieties are 
chosen such that the excitation spectrum of one of the moieties (the acceptor fluorescent 
protein moiety) overlaps with the emission spectrum of the excited protein moiety (the 
donor fluorescent protein moiety). The donor and acceptor fluorescent protein moieties 
are bound to a binding protein moiety that changes conformation upon binding the 

10 analyte. The change in conformation leads to a change in relative position and 
orientation of the donor and acceptor fluorescent protem moieties, thereby altering the 
relative amounts of fluorescence from the two fluorescent protein moieties when the 
donor is excited by irradiation. In particular, binding of the analyte changes the ratio of 
the amount of light emitted by the donor and acceptor fluorescent protein moieties. The 

15 ratio between the two emission wavelengths provides a measure of the concentration of 
the analyte in the sample, which is based in part on the binding afifmity of the binding 
protein moiety and the analyte. 

Referring to FIG. 1 , the donor fluorescent protein moiety is covalently linked to a first 
region (e.g., the amino terminus) of the binding protem moiety, and the acceptor 

20 fluorescent protein moiety is covalently linked to a second region (e.g., the carboxy 
terminus) of the binding protein moiety such that the donor and acceptor moieties move 
closer together upon binding the analyte. Alternatively, the donor and acceptor moieties 
can move farther apart upon binding the analyte. In one embodiment, depicted in FIG. 
1 , the acceptor moiety is covalently bonded to a target peptide moiety that also binds to 

25 the binding protein moiety and the target peptide moiety is covalently bonded to the 
binding protein moiety by a linker moiety. The linker moiety is flexible enough to allow 
the target peptide moiety to bind to the binding protein moiety. The donor moiety is 
excited by light of appropriate intensity within the excitation spectrum of the donor 
moiety (Sc^citation)' The donor moiety emits .the absorbed energy as fluorescent light 

^0 (^emission i)- When the acceptor fluorescent protein moiety is positioned to quench the 
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donor moiety in the excited state, the fluorescence energy is transferred to the acceptor 
moiety which can emit fluorescent light (li^i^n 2)- FRET can be manifested as a 
reduction in the intensity of the fluorescent signal from the donor moiety (SemisiDn 
reduction in the lifetime of the excited state of the donor moiety, or emission of 
5 fluorescent light at the longer wavelengths (lower energies) characteristic of the acceptor 
moiety (Semission 2)- When the conformation of the binding protein moiety changes upon 
binding the analyte, the fluorescent protem moieties come closer together (or physically 
separate), and FRET is increased (or decreased) accordingly. 

The efficiency of FRET depends on the separation distance and the orientation of the 
10 donor and acceptor fluorescent protein moieties. For example, the Forster equation 
describes the efiSciency of excited state energy transfer, based in part on the fluorescence 
quantum yield of the donor moiety and the energetic overlap with the acceptor moiety. 
The Forster equation is: 

E = (Fo-F)/Fo -RoV(R^ + Ro') 

IS where E is the efficiency of FRET, F and Fq are the fluorescence intensities of the donor 
moiety in the presence and absence of the acceptor, respectively, and R is the distance 
between the donor moiety and the acceptor moiety. 

■ 

The characteristic distance Ro at which FRET is 50% efficient depends on the quantum 
yield of the donor moiety (/.e., the shorter-wavelength fluorophore), the extinction 
20 coefficient of the acceptor moiety (i.e., the longer-wavelength fluorophore), and the 
overlap between the emission spectrum of the donor moiety and the excitation spectrum 
of the acceptor moiety* Ro is given (in D) by 
Ro-9.79xlO'(K'QJn^y^ 

where is an orientation factor having an average value close to 0.67 for freely mobile 
25 donors and acceptors, Q is the quantum yield of the unquenched donor moiety, n is the 
refiactive index of the mediimi separating the donor moiety and the acceptor moiety, and 
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J is the overlap integral. J can be quantitatively expressed as the degree of spectral 
overlap between the donor moiety and the acceptor moiety according to the equation: 



J = I'oM2Fj:EM2yiVEdS 

where (M'cm*') is the molar absorptivity of the acceptor and Fj is the donor moiety 
5 fluorescence intensity at wavelength S. See, for example, Forster, T. Am.Physik 2 : 5 5-75 
(1948). Tables of spectral overlap integrals are readily available to those working in the 
field (for example, Berlman, LB. Energy transfer parameters of aromatic compounds^ 
Academic Press, New York and London (1973)). FRET is a nondestructive 
spectroscopic method that can monitor proximity and relative angular orientation of 
1 0 fluorophores in living cells. See, for example, Adams, S.R., et al , Nature 349:694-697 
(1991), and Gonzalez, J. & Tsien, R.Y. BiophyJ. 69:1272-1280 (1995). 

These factors need to be balanced to optimize the efficiency and detectability of FRET 
from the fluorescent indicator. The emission spectrum of the donor fluorescent protein 
moiety should overlap as much as possible with the excitation spectrum of the acceptor 

15 fluorescent protein moiety to maximize the overlap integral J. Also, the quantum yield 
of the donor fluorescent protein moiety and the extinction coefficient of the acceptor 
fluorescent protein moiety should be as large as possible to maximize Rq. In addition, 
the excitation spectra of the donor and acceptor moieties should overlap as little as 
possible so that a wavelength region can be found at which the donor moiety can be 

20 excited selectively and efficiently without directly exciting the acceptor moiety. Direct 
excitation of the acceptor moiety should be avoided since it can be difficult to distinguish 
direct emission firom fluorescence arising firom FRET. Similarly, the emission spectra 

♦ 

of the donor and acceptor moieties should have minimal overlap so that the two 
emissions can be distinguished. High fluorescence quantum yield of the acceptor moiety 
25 is desirable if the emission firom the acceptor moiety is to be monitored to determine 
analyte concentration in a sample. In a preferred embodiment, the donor fluorescent 
protein moiety is excited by ultraviolet (<400 nm) and emits blue light (<500 nm), and 
the acceptor fluorescent protein moiety is efficiently excited by blue but not by 
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ultraviolet light and emits green light (>500 nm), for example, P4-3 and S65T, 
respectively. 



In another preferred embodiment, the donor fluorescent moiety is excited by violet (400 



5 efficiently excited by blue-green (450-500 nm) and emits yellow-green light ($520 nm), 
for example WIB and 1 OC respectively. 

The amoimt of analyte in a sample can be determined by determining the degree of FRET 
in the sample. Changes in analyte concentration can be determined by monitoring FRET 
at a first and second time after contact between the sample and the fluorescent indicator 
10 and determining the difference in the degree of FRET. The amount of analyte in the 
sample can be calculated by using a calibration curve established by titration. 



The degree of FRET can be determined by any spectral or fluorescence lifetime 



characteristic of the excited donor moiety. For example, intensity of the fluorescent 



signal from the donor, the intensity of fluorescent signal from the acceptor, the ratio of 
IS the fluorescence amplitudes near the acceptor's emission maxima to &e fluorescence 
amplitudes near the donor's emission maximimi, or the excited state lifetime of the donor 
can be monitored. 



Preferably, changes in the degree of FRET are determined as a function of the change in 
the ratio of the amoimt of fluorescence Scorn the donor and acceptor moieties, a process 
20 referred to as "ratioing.** Changes in the absolute amount of indicator, excitation 



intensity, and turbidity or other background absorbances in the sample at the excitation 
wavelength affect the intensities of fluorescence fix)m both the donor and acceptor 
approximately in parallel. Therefore the ratio of the two emission intensities is a more 
robust and preferred measure of cleavage than either intensity alone. 

25 Fluorescence in a sample is measured using a fluorometer. In general, excitation 
radiation, from an excitation source having a first wavelength, passes through excitation 



430 rmi) and emits blue-green (450-500 nm) and the acceptor fluorescent moiety is 
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optics. The excitation optics cause the excitation radiation to excite the sample. In 
response, fluorescent proteins in the sample emit radiation wiiich has a wavelength that 
is different from the excitation wavelength. Collection optics then collect the emission 
fiom the sample. The device can include a temperature controller to maintain the sample 

5 at a specific temperature while it is being scanned. According to one embodiment, a 
multi-axis translation stage moves a microtiter plate holding a plurality of samples in 
order to position different wells to be exposed. The multi-axis translation stage, 
temperature controller, auto-focusing feature, and electronics associated with imaging 
and data collection can be managed by an appropriately programmed digital computer. 

10 The computer also can transform the data collected during the assay into another format 
for presentation. 

Methods of performing assays on fluorescent materials are well knoAvn in the art and are 
described in, e.g., Lakowicz, J.R., Principles of Fluorescence Spectroscopyy New 
YorkiPlenum Press (1983); Herman, B., Resonance energy transfer microscopy, in: 
15 Fluorescence Microscopy of Living Cells in Culture ^ Pari B, Methods in Cell Biology ^ 
vol. 30, ed. Taylor, D.L. & Wang, Y.-L., San Diego: Academic Press (1989), pp. 219- 
243; Turro, N.J., Modern Molecular Photochemistry^ Menlo Park: Benjamin/Cummings 
Publishing Col, Inc. (1978), pp. 296-361. 

The excited state lifetime of the donor moiety is, likewise, independent of the absolute 
20 amount of substrate, excitation intensity, or turbidity or other background absorbances. 
Its measurement requires equipment vAih nanosecond time resolution. 

Quantum yields of wild-type GFP, S65T, and P4-1 mutants can be estimated by 
comparison with fluorescein in 0.1 N NaOH as a standard of quantum yield 0.91. J.N. 
Miller, ed.. Standards in Fluorescence Spectrometry^ New York: Chapman and Hall 
25 (1981). Mutants P4 and P4-3 were likewise compared to 9-aminoacridine in water 
(quantum yield 0.98). 
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Any fluorescent protein can be used in the invention, including proteins that fluoresce 
due intramolecular rearrang^ents or ft^ addition of cofactors that promote fluorescence. 
For example, green fluorescent proteins of cnidaiians, which act as their energy-transfer 
acceptors in bioluminescence, are smtable fluorescent proteins for use in the fluorescent 

5 indicators. A green fluorescent protein ("GFP") is a protein that emits green light, and 
a blue fluorescent protein ("BFP") is a protein that emits blue light. GFPs have been 
isolated from the Pacific Northwest jellyfish, ^eqf«orea victoria^ the sea pansy, Renilla 
reniformis, and Phialidium gregarium. See, Ward, W.W., et aly Photochem, PhotobioLy 
35:803-808 (1982); and Levine, L.D., et al, Comp. Biochem, Physiol, 72B:77-85 

10 (1982). 

A variety of iie^worea-related GFPs having usefiil excitation and emission spectra have 
been engineered by modifying the amino acid sequence of a naturally occurring GFP 
fK>mAequorea victoria. See, Prasher, D.C., a/.. Gene, 1 1 1:229-233 (1992); Heim, R., 
etal.Proc, Natl Acad ScL, USA, 91:12501-04 (1994); U.S. Ser. No. 08/337,915, filed 

15 Nov^ber 10, 1994; International application PCT/US95/14692, filed 1 1/10/95; and U.S. 
Ser. No. 08/706,408, filed August 30, 1996. The cDNA of GFP can be concatenated 
with those encoding many other proteins; the resulting fusions often are fluorescent and 
retain the biochemical features of the partner proteins. See, Cubitt, A.B., et al. Trends 
Biochem. ScL 20:448-455 (1995). Mutagenesis studies have produced GFP mutants with 

20 shifted wavelengdis of excitation or emission. See, Heim, R. & Tsien, R.Y. Current 
Biol 6:178-182 (1996). Suitable pairs, for example a blue-shifted GFP mutant P4-3 
(Y66H/Y145F) and an improved green mutant S65T can respectively serve as a donor 
and an acceptor for fluorescence resonance energy transfer (FRET). See, Tsien, R.Y., 
et a/.. Trends Cell Biol 3:242-245 (1993). A fluorescent protein is anide^uorea-related 

25 fluorescent protein if any contiguous sequence of 150 amino acids of the fluorescent 
protein has at least 85% sequence identity with an amino acid sequence, either 
contiguous or non-contiguous, from the wild type Aequorea green fluorescent protein. 
More preferably, a fluorescent protein is an Aequorea-relaXed fluorescent protein if any 
contiguous sequence of 200 amino acids of the fluorescent protein has at least 95% 

30 sequence identity with an amino acid sequence, either contiguous or non-contiguous, 
from the wild typo Aequorea green fluorescent protein. Similarly, the fluorescent protein 
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can be related to Renilla or Phialidium wild-type fluorescent proteins using the same 
standards. Some Aequorea-rtXtiXi^d, engineered versions described in Table I. Other 
variants or mutants are within the scope of the invention as described, for example, in the 
Examples. 



TABLE! 



Clone 


MutaHonfs) 


Exdtaaon max 


Enassion 


Extinction CoefHcknt 


Quantum 






(nm) 


max(nm) 




vield 


WUdtvpe 


none 


395 (475) 


508 


21,000(7,150) 


0.77 


P4 


Y66H 


383 


447 


13,500 


0.21 


P4-3 


Y66H;Y145F 


381 


445 


14,000 


0.38 


W7 


Y66W;N146I 


433 (453) 


475(501) 


18,000(17,100) 


0.67 




M153T 








- 




V163A 












N212K 










W2 


Y66W;I123V 


432 (453) 

* 


480 


10,000 (9,600) 


0.72 




Y145H 












Hi48R 












M153T 












VI 63 A 












N212K 










S6ST 


S65T 


489 


511 


39,200 


0.68 


P4'l 


S65T;M153A 


504 (396) 


514 


14,500 (8,600) 


0.53 




K238E 










S65A 


S65A 


471 


504 






S65C 


S65C 


479 


507 






S6SL 


S65L 


484 


510 






Y66F 


Y66F 


360 


442 






Y66W 


Y66W 


458 


480 






10c 


S65G;V68L 


513 


527 








S72A:T203Y 











10 



15 
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F64L;S65T 

Y66W;N146I 

M153T 

V163A 

N212K 


432 (453) 


476 (503) 




Emerald 


S65T;S72A 
N149K 
MI53T 
I167T 


487 


508 
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An additional clone, WlBl included the following mutations: F64L;S65T; Y66W; F99S; 
andV163A. 



Other fluorescent proteins can be used in the fluorescent indicators, such as, for example, 
5 yellow fluorescent protein fix)m Vibrio fischeri strain Y-1, Peridinin-chlorophyll a 
binding protein from the dinoflagellate Symbiodinitm sp.phycobiliproteins from marine 
cyanobacteria such as Synechococcus^ e,g,, phycoerythrin and phycocyanin, or oat 
phytochromes from oat reconstructed with phycoerythrobilin. These fluorescent proteins 
have be^ described in Baldwin, T.O., et al, Biochemistry 29:5509-55 15 (1990), Morris, 
10 B J., et al„ Plant Molecular Biology, 24:673-677 (1994), and Wilbanks, S.M., et al, 1 
Biol Chem. 268:1226-1235 (1993), and Li et ai, Biochemistry 34:7923-7930 (1995). 

The efficiency of FRET between the donor and acceptor fluorescent protein moieties can 
be adjusted by changing ability of the two fluorescent proteins to closely associate. The 
nature of the binding protein moiety, target peptide moiety, and linicer moiety each affect 

■ 

15 the FRET and the response of the indicator to the analyte. Generally, large 
conformational changes in the binding protein moiety are desired along with a high 
affinity for the target peptide moiety. 



The binding protein moiety is a protein, or part thereof, that changes conformation upon 
binding an analyte. Proteins that imdergo useful conformation change upon binding an 

20 analyte include calmodulin (CaM), cGMP-dependent protein kinase, steroid hormone 
receptors (or ligand binding domain thereof), protein kmase C, inositol- 1,4,5- 
triphosphate receptor, or recoverin. See, for example, Katzenellenbogen, J.A. & 
Katzenellenbogen, B.S. Chemistry & Biology 3:529-536 (1996), and Ames, J.B., et al, 
Curr. OpifL Struct. Biol 6:432-438 (1996). The binding protein moiety preferably binds 

25 target peptides in addition to the analyte. The Ca^^-binding affinity of calmodulin can 
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« 

be tuned as reviewed, for example, in Falke, J J., et al^ QuarL Rev. Biophys, 27:219-290 
(1994). 

The target peptide moiety can contain any of the amino acid sequences in Table 11, or a 
portion thereof with the proviso that the target peptide must bind to the binding protein 

5 moiety. The target peptide can be a subsequence of a calmodulin-binding domain. The 
target peptide moieties listed in Table 11 are recognized by the binding protein moiety 
CaM, See, for example, Crivici, A. & Ikura, M. Amu. Rev, Biophys. Biomol Struct 
24:84-1 16 (1995). The target peptide moiety can be modified to enhance the response 
of the fluorescent indicator to the analyte. Other target peptide moieties are known in the 

10 art for other binding proteins. 
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Target* 

skMLCK(M13) 

smMLCK (smMLCKp) 
S CaMKS 

Caldesmon 

Calspennin 

PFK(M11) 

Calcineurin 
10 PhK(PhK5) 
(PhKB) 

Ca^^-ATPase (C28W) 

59- kDaPDE 

60- kDa PDE 
15 NOS(NO-30) 

1^ I AC (AC-28) 

Borderella pertussis AC 

Neuromodulin 
. Spectrin 
20 MARCKS 

FS2 or MacMARKS 

fi-Adducin 

HSP90a 

HIV-1 gpl60 
25 BBMHBI 

Dilute MHC 

Mastoparan 

Melittin 

Glucagon 
30 Secretin 

VIP 



TABLED 
Sequence 

KRRWKKNFIAVSAANRFKKISSSGAL 
ARRKWQKTGHAVRAIGRLSS 
ARRKLKGAILTTMLATRNFS 

GVKNIKSMWEKGNVFSS 
ARRKLKAAVKAWASSRLGS 
FMNNWEVYKLLAHIRPPAPKSGSYTV 
ARKEVIRNKIRAIGKMARVFSVLR 
LRRLIDAYAFRIYGHWVKKGQQQNRG 
RGKFKVIGLTVLASVRIYYQYRRVKPG 
LRRGQILWFRGLNRIQTQIKWNAFSSS 
RRKHLQRPIFRLRCLVKQLEK 
TEKMWQRLKGILRCLVKQLEK 
KRRAIGFKKLAEAVKFSAKLMGQ 
KPAKRMKFKTVCYLLVQLMHCRKMFKA 
K)LLWKIARAGARSAVGTEA 
KAHKAATKIQASFRGHITRKKLKGEKK 
KTASPWKSARLMVHTVATFNSKE 

KKKKKRFSFKKSFKLSGFSFKKSKK 

■ 

• KKKKKFSFKKPFKLSGLSFKKNRK 

t 

KQQKEKTRWDmWYUlVNVADEVQRNMGS 
KDQVANSAFQERLRKHGLEVI 
YHRLRDLLLIVKRIVELLGRR 
QQLATLIQKTYRGWRCRTHYQLM 
RAACIRIQKTIRGWLLRKRYLCMQ 
INLKALAALAKKIL 
GIGAVLKVLTTGLPALISWIKRKRQQ 
HSQGTFTTSDYSKYLDSRRAQDFVQWLMNT 
HSDGTFTSELSRLRDSARLQRLLQGLV 

HSDAVFTDNYTRLRKQMAVKKYLNSILN 
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GIP 



YADGTHSDYSAIMNKIRQQDFVNWLLAQQQKS 



Model Peptide CBP2 



KLWKKLLKLLKKLLKLG 



' Abbreviations: AC, adenylyl cyclase; BBMHCI, brush-border myosin heavy chain-I; 
CaMKII, calmodulin kinase II; CBP2, calmodulin binding peptide-2; GIP, gastrin 
5 inhibitory peptide; HTV- 1 gpl 60, human inmiunodeficiency virus envelope glycoprotein 
160; HSP, heat-shock protein; MARCKS, myiistoylated alaminte-rich C kinase substrate; 



phosphofiructokinase; PhK, phosphotylase kinase; sk-, smMLCK, skeletal muscle- and 
smooth muscle-myosin light chain kinase; VIP, vasoactive intestinal peptide. 

10 The length of the linker moiety is chosen to optimize both FRET and the kinetics and 
specificity of conformational changes induced by analyte binding. The linker moiety 
should be long enough and flexible enough to allow the binding protein moiety and target 
peptide moiety to freely interact and respond to analyte concentration. In order to 
optimize the FRET effect, the average distance between the donor and acceptor 

15 fluorescent protein moieties should become between about 1 nm and about 10 nm, 
preferably between about 1 nm and about 6 nm, and more preferably between about 1 nm 
and about 4 nm, when the analyte is boimd (or released). If the linker is too short or too 
stiff, the donor and acceptor protein moieties cannot readily change position. If the 
linker moiety is too long, the target peptide moiety might not bind to the binding protein 

20 moiety effectively. The linker moiety is, preferably, a peptide moiety. The preferred 
linker moiety is a peptide between about one and 30 amino acid residues in length, 
preferably between about two and 1 5 amino acid residues. One preferred linker moiety 
is a -Gly-Gly- linker. 

The linker moiety can include flexible spacer amino acid sequences, such as those known 



25 in single-chain antibody research. For example, the linker moiety can be GGGGS 
(GGGGS)„, GKSSGSGSESKS, GSTSGSGKSSEGKG, GSTSGSGKSSEGSGSTKG, 
GSTSGSGKSSEGKG,GSTSGSGKPGSGEGSTiiCG,or EGKSSGSGSESKEF. Unking 
moieties are described, for example, in Huston, J.S., et al.PNAS 85:5879-5883 (1988), 



MHC, myosin heavy cham; NOS, nitric oxide synthase; PDE, phosphodiesterase; PFK, 
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Whitlow, M., et al. Protein Engineering 6:989-995 (1993), and Newton, D.L., et al. 
Biochemistry 35:545-553 (1996). 

Another embodiment of the invention includes a nucleic acid molecule, comprising a 
polynucleotide that encodes a fluorescent protein energy transfer partner, comprising: a 
5 first chemical entity bmding region that binds to a chemical entity and binds to a second 
protein partner, and a fluorescent protein in frame with the first chemical entity binding 
region; wherein the fluorescent protein is a first energy transfer partner for a second 
energy transfer partner attached to the second protein partner. Typically the fluorescent 
protein is an Aequorea-xQXd^&A fluorescent protein., 

10 The first chemical entity binding region can comprise a bound conformation of smaller 
volume when the chemical entity is bound to the first chemical entity binding region 
compared to the first chemical entity binding region when the ch^cal entity is not 
bound to the first chemical entity binding region. The bound conformation can permit 
increased energy transfer between the first energy transfer partner and the second energy 

1 5 transfer partner. 



The bound conformation may also increase the afiSnity between the first chemical entity 
binding region and the second protein partner. Often the first chemical entity binding 
region binds to a protein association region of the second protein partner. In many cases 
the chemical entity is an analyte, as described herein. The chemical entity could also 
20 include other entities such as cellular proteins. 

Preferably, the first chemical entity binding region is either cahnodulin, a calmodulin- 
related protein moiety, cGMP-dependent protein kinase, a steroid hormone receptor, a 
ligand binding domain of a steroid hormone receptor, protein kinase C, inositol-l ,4,5- 
triphosphate receptor, or recoverin. More preferably, the first chemical entity binding 
25 region is calmodulin or a calmodulin-related protein. 
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In some aspects of this embodiment of the invention the fluorescent protein energy 
transfer partner further comprises a second energy transfer partner in frame with the first 
energy transfer partner. Such embodiments can be produced by molecular biology 
techniques as opposed to protein synthesis techniques. Proteins can be produced by 
5 either technique to produce fluorescent protein that act as energy transfer partners. For 
example, the first energy transfer partner is either P4-3, EBFP, or WIB, and the second 
energy transfer partner is either S65T, EGFP, or 10c. 

In addition energy partners can be generated using labelling methods known in the art 
for tagging proteins with fluorescent labels (e.g., biochemical reagents and non-protein 
10 fluorescent tags). 

The fluorescent protein energy transfer partner can also further comprise a second protein 
partner comprising the second energy transfer partner in frame with the first energy 
transfer partner. The second protein partner can also further comprises a protein 
association region. 



15 The second protein partner is eiUier a calmodulin-binding domain of skMLCKp, 
smMLCK, CaMKn, Caldesmon, Calspermin, phosphofinctokinase calcineurin, 
phosphoiylase kinase, Ca2+-ATPase 59 kDa PDE, 60 kDa PDE, nitric oxide synthase, 
type I adenylyl cylclase, Bordetella pertussis adenylyl cyclase, Neuromodulin, Spectrin, 
MARCKS, F52, B-Adducin, HSP90a, HIV4 gpl60, BBMHBI, Dilute MHC, M- 

20 astoparan, Melittin, Glucagon, Secretin, VIP, GIP, or Model Peptide CBP2. Preferably, 
the protein association region is Ml 3. The invention can include an amino acid linker 
in frame with the first chemical entity binding region and fusing the protein association 
region with the first chemical entity binding region. The invention can also include a 
fluorescent protein energy transfer partner that further comprises a localization sequence. 
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Another aspect of the present invention is a nucleic acid molecule, comprising a 
polynucleotide that encodes l)a first fluorescent protein energy transfer partner, 
comprising: a chemical entity binding region tiiat binds to a chemical entity and binds 
to a second fluorescent protein energy transfer partner, and a first fluorescent protein in 

5 frame with the first chemical entity binding region; and 2) the second fluorescent protein 
energy transfer partner, comprising: a protein association region that binds to the 
chemical entity binding region, and a second fluorescent protein in firame with the protein 
association region; wherein the first fluorescent protein and the second fluorescent 
protein are energy transfer partners and the first fluorescent protein energy transfer 

10 partner is in fi^e with the second first fluorescent protein energy transfer partner. The 
first and second fluorescent proteins can be Aequorea-relsiiGd fluorescent proteins. 

A further aspect of the present invention is this nucleic acid molecule, wherem the 
chemical entity binding re^on comprises a bound conformation of smaller volume when 
the chemical entity is bound to the chemical entity binding region compared to the 

15 chemical entity binding region when the chemical entity is not bound to the chemical 
entity binding region and the bound conformation permits increased energy transfer 
between the first fluorescent protein energy transfer partner and the second fluorescent 
protein energy transfer partner. Another aspect of the present invention is this nucleic 
acid, wherein the bound conformation increases the affinity between the chemical entity 

20 binding region and the second fluorescent protein energy transfer partner. The chemical 
entity is an analyte. 

Another aspect of the present invention is a system for monitoring protein-protein 
association, comprising: 1) a first fluorescent protein energy transfer partner, comprising: 
a chemical entity binding region that binds to a chemical entity and binds to a second 
25 fluorescent protein energy transfer partner, and a fust energy transfer partner, and 2) the 
second fluorescent protein energy transfer partner, comprising: a protein association 
region that binds to the chemical entity binding region, and a second energy transfer 
partner; wherein the first energy transfer partner and the energy transfer partner are 
energy transfer partners. 



WO9840477 [http://www.getthepateiil I ilii|j^fc|iiik99/FetchAA/O9840477xpc?fromCache=1part=mai^>toaR 




WO i98/40477 PCT/US98/04978 



-26- 

The fluorescent indicators can also include a localization sequence to direct the indicator 
to particular cellular sites by fusion to appropriate organellar targeting signals or 
localized host proteins. A polynucleotide encoding a localization sequence^ or signal 
sequence, can be ligated or fused at the 5' terminus of a polynucleotide encoding the 

5 fluorescence indicator such that the signal peptide is located at the amino terminal end 
of the resulting fusion polynucleotide/polypeptide. In the case of eukaryotes, the signal 
peptide is believed to fimction to transport the fiision polypeptide across the endoplasmic 
reticulum. The secretory protein is then transported through the Golgi apparatus, into 
secretory vesicles and into the extracellular space or, preferably, the external 

10 environment. Signal peptides which can be utilized according to the invention include 
pre-pro peptides which contain a proteolytic enzyme recognition site. Other signal 
peptides with similar properties to pro-calcitonin described herein are known to those 
skilled in the art, or can be readily ascertained without undue experimentation. 

The localization sequence can be a nuclear localization sequence, an endoplasmic 
15 reticulum localization sequence, a peroxisome localization sequence, a mitochondrial 
localization sequence, or a localized protein. Localization sequences can be targeting 
sequences v^ch are desmbed, for example, in "Protein Targeting", chapter 35 of Sttyer, 
L., Biochemistry (4th ed.). W.H. Freeman, 1995. The localization sequence can also be 
a localized protein. Some important localization sequences include those targeting the 
20 nucleus (KKKRK), mitochondrion (amino terminal 
MUlTSSLFTRRVQPSIi^lNILRLQST-), endoplasmic reticulum (KDEL at C-terminus, 
assuming a signal sequence present at N-terminus), peroxisome (SKF at C-terminus), 
prenylation or insertion into plasma membrane (CaaX, CC, CXC, or CCXX at C- 
terminus), cytoplasmic side of plasma membrane (fusion to SNAP-25), or the Golgi 
25 apparatus (fusion to furin). 

The fluorescent indicators can be produced as fusion proteins by recombinant DNA 
technology. Recombinant production of fluorescent proteins involves expressing nucleic 
acids having sequences that encode the proteins. Nucleic acids encoding fluorescent 
proteins can be obtained by methods known in the art. For example, a nucleic acid 
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encoding the protein can be isolated by polymerase chain reaction of cDNA from A. 
victoria using primers based on the DNA sequence of A, victoria green fluorescent 
protein. PGR methods are described in, for example, U.S. Pat. No. 4,683,195; Mullis, 
et aL Cold Spring Harbor Symp, Quant. Biol 51:263 (1987), and Erlich, ed., PCR 
5 Technologyy (Stockton Press, NY, 1 989). Mutant versions of fluorescent proteins can be 
made by site-specific mutagenesis of other nucleic acids encoding fluorescent proteins. 



or by random mutagenesis caused by increasing the error rate of PCR of the original 
polynucleotide with 0.1 mM MnCl2 and unbalanced nucleotide concentrations. See, e.g., 
U.S. patent application 08/337,915, filed November 10, 1994 or International application 
10 PCT/US95/14692, filed 1 1/10/95. 

The . construction of expression vectors and the expression of genes in transfected cells 
involves the use of molecular cloning techniques also well known in the art. Sambrook 
et al.f Molecular Cloning - A Laboratory Manual ^ Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, (1989) and Current Protocols in Molecular Biology, F.M. Ausubel 
15 et aU eds., (Current Protocols, a joint venture between Greene Publishing Associates, 
Inc. and John Wiley & Sons, Inc., most recent Supplement). 

Nucleic acids used to transfect cells with sequences coding for expression of the 
polypeptide of interest generally will be in the formi of an expression vector including 
expression control sequences operatively linked to a nucleotide sequence coding for 

20 expression of the polypeptide. As used, the term "nucleotide sequence coding for 
e?q)ression or a polypeptide refers to a sequence that, upon transcription and translation 
of mRNA, produces the polypeptide. This can include sequences containing, e.g., 
introns. As used herein, the teim "expression control sequences" refers to nucleic acid 
sequences that regulate the expression of a nucleic acid sequence to which it is 

25 operatively linked Expression control sequences are operatively linked to a nucleic acid 
sequence when the expression control sequences control and regulate the transcription 
and, as appropriate, translation of the nucleic acid sequence. Thus, expression control 
sequences can include appropriate promoters, enhancers, transcription terminators, a start 
codon (z.e., ATG) in fi-ont of a protein-encoding gene, splicing signals for introns, 
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maintoiance of the correct reading frame of that gene to permit proper translation of the 
niRNA, and stop codons. 

Methods which are well known to those skilled in the art can be used to construct 
ejqjression vectors containing the fluorescent indicator coding sequence and appropriate 
5 transcriptional/translational control signals. These methods include in vitro recombinant 
DNA techniques, synthetic techniques and in vivo recombination/genetic recombination. 
(See, for example, the techniques described in Maniatis, et al, Molecular Cloning A 
Laboratory ManuaU Cold Spring Harbor Laboratory, N.Y., 1 989). 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
10 techniques as are well known to those skilled in the art. Where the host is prokaryotic, 
such as£ co/i, competent cells which are capable of DNA uptake can be prepared from 
cells harvested after exponential growth phase and subsequently treated by the CaCl2 
method by procedures well known in the art. Alternatively, MgCl2 or RbCl can be used. 
Transformation can also be performed after forming a protoplast of the host cell or by 
IS electroporation. 

When the host is a eukaryote, such methods of transfection of DNA as calcium phosphate 
co-precipitates, conventional mechanical procedures such as microinjection, 
electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be 
used. Eukaryotic cells can also be cotransfected with DNA sequences encoding the 

20 fusion polypeptide of the invention, and a second foreign DNA molecule encoding a 
selectable phenotype, such as the herpes simplex thymidine kinase gene. Another 
method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine 
papilloma virus, to transiently infect or transform eukaryotic cells and express the 
protein. {Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1982). 

25 Preferably, a eukaryotic host is utilized as the host cell as described herein. 
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Techniques for the isolation and purification of either microbially or eukaryotically 
expressed polypeptides of the invention may be by any conventional means such as, for 
example, preparative chromatographic separations and immunological separations such 
as those involving the use of monoclonal or polyclonal antibodies or antigen. 

5 A variety of host-expression vector systems may be utilized to express fluorescent 
indicator coding sequence. These include but are not limited to microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors containing a fluorescent indicator coding sequence; yeast 
transformed with recombinant yeast expression vectors containing the fluorescent 

10 indicator coding sequence; plant cell systems infected with recombinant virus expression 
vectors (^.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors (eg., Ti plasmid) containing 
a fluorescent indicator coding sequence; insect cell systems infected with recombinant 
virus expression vectors baculovinis) containing a fluorescent indicator coding* 

15 sequence; or animal cell systems infected with recombinant virus expression vectors 
(e.g., retroviruses, adenovirus, vaccinia virus) containing a fluorescent indicator coding 
sequence, or transformed animal cell systems engineered for stable expression. 

Depending on the hostA^ector system utilized, any of a number of suitable transcription 
and translation elements, including constitutive and inducible promoters, transcription 

20 enhancer elements, transcription terminators, etc. may be used in the expression vector 
(sec, e,g. , Bitter, et al , Methods in Enzymology 1 53 :5 1 6-544, 1 987). For example, when 
cloning in bacterial systems, inducible promoters such as pL of bacteriophage S, plac, 
ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. When cloning in 
mammalian cell systems, promoters derived firom the genome of mammalian cells (e.g., 

25 metallothionein promoter) or from mammalian viruses (e.g. , the retrovirus long terminal 
repeat; the adenovirus late promoter; the vaccinia virus 7.5K promoter) may be used. 
Promoters produced by recombinant DNA or synthetic techniques may also be used to 
provide for transcription of the inserted fluorescent indicator coding sequence. 
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In bacterial systems a number of expression vectors may be advantageously selected 
depending upon the use intended for the fluorescent indicator expressed. For example, 
when laige quantities of the fluoresc^t indicator are to be produced, vectors which direct 
the expression of high levels of fusion protein products that are readily purified may be 
5 desirable. Those which are engineered to contain a cleavage site to aid in recovering 
fluorescent indicator are preferred. 

In yeast, a number of vectors containing constitutive or inducible promoters may be used. 
For a review see, Current Protocols in Molecular Biology, Vol. 2, Ed. Ausubel, et aL, 
Greene Publish. Assoc. & Wiley Interscience, Ch. 13, 1988; Grant, et aL, Expression and 
k 10 Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 3 1987, 

Acad. Press, N.Y., Vol. 153, pp.516-544, 1987; Glover, DNA Cloning, Vol. n, IRL 
Press, Wash., D.C., Ch. 3, 1986; and Bitter, Heterologous Gene Expression in Yeast, 
Methods in Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol 152, pp. 673- 
684, 1 987; and The Molecular Biology of the Yeast Saccharomyces, Eds. Strathem et aL , 
15 Cold Spring Harbor Press, Vols. I and II, 1982. A constitutive yeast promoter such as 
ADH or LEU2 or an inducible promoter such as GAL may be used (Cloning in Yeast, 
Ch. 3, R. Rothstem In: DNA Cloning VoLll, A Practical Approach, Ed. DM Glover, IRL 
Press, Wash., D.C., 1986). Alternatively, vectors may be used which promote integration 
of foreign DNA sequences into the yeast chromosome. 



20 In cases wheie plant expression vectors are used, the expression of a fluorescent indicator 
coding sequence may be driven by any of a nimiber of promoters. For example, viral 
promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson, et aL, 
Nature 3 10:5 11 -5 14, 1984), or the coat protein promoter to TMV (Takamatsu, et aL, 
EhfBOJ. 6:307-31 1, 1987) may be used; alternatively, plant promoters such as the small 

25 subunit of RUBISCO (Coruzzi, et aL, 1984, EMBO J. 3:1671-1680; Broglie, et aL, 
Science 224:838-843, 1984); or heat shock promoters, e.g., soybean hspl7.5-E or 
hspl7.3-B (Guriey, et aL, MoL CelL BioL 6:559-565, 1986) may be used. These 
constructs can be introduced into plant cells using Ti plasmids, Ri plasmids, plant virus 
vectors. 
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4 

direct DNA transformation, microinjection, electroporation, etc. For reviews of such 
techniques see, for example, Weissbach & Weissbach, Methods for Plant Molecular 
Biology, Academic Press, NY, Section Vm, pp. 421-463, 1988; andGrierson & Corey, 

* 

Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9, 1988. 



An altemative expression system which could be used to express fluorescent indicator 
is an insect system. In one such system, Autographa californica nuclear polyhedrosis 



Spodopterafrugiperda cells. The fluorescent indicator coding sequence may be cloned 
into non-essential regions (for example, the polyhedrin gene) of the virus and placed 

10 under control of an AcNPV promoter (for example the polyhedrin promoter). Successful 
insertion of the fluorescent indicator coding sequence will result in inactivation of the 
polyhedrin gene and production of non-occluded recombinant virus (i.e., vims lacking 
the proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are 
then used to infect Spodoptera frugiperda cells in which the inserted gene is expressed, 

15 see Smith, etal, J. Viol. 46:584, 1983; Smith, U.S. Patent No. 4,215,051; 

Eukaiyotic systems, and preferably mammalian expression systems, allow for proper 
post-translational modifications of expressed mammalian proteins to occur. Eukaryotic 
cells which possess the cellular machinery for proper processing of the primary 
transcript, glycosylation, phosphorylation, and, advantageously secretion of the gene 
20 product should be used as host cells for the expression of fluorescent indicator. Such 



MDCK, Jurkat, HEK-293, and WD8. 

Mammalian cell systems which utilize recombinant viruses or viral elements to direct 
expression may be engineered. For example, when using adenovirus expression vectors, 
25 the fluorescent indicator coding sequence may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro 
or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., 
region El or E3) will result in a recombinant vmis that is viable and capable of 



virus (AcNPV) is used as a vector to express foreign genes. The virus grows in ! 

1 



I 



host cell lines miay include but are not limited to CHO, VERO, BHK, HeLa, COS, | 
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expressing the fluorescent indicator in infected hosts {e,g.^ see Logan & Shenk, Proc. 
Natl. Acad Set USA, 81: 3655-3659, 1984). Alternatively, the vaccmia virus 7.5K 
promotCT may be used. (e.g., see, Mackett, et al, Proc Natl Acad. ScL USA ,79: 7415- 
7419, 1982; Mackett, al.. J. Virol. 49: 857-864, 1984; Panicali, et al, Proc. Natl. Acad. 

5 Sci. USA 79: 4927-4931, 1982). Of particular interest are vectors based on bovine 
papilloma virus which have the ability to replicate as extrachromosomal elements 
(Sarver, et al., Mol Cell Biol 1 : 486, 1981). Shortly after entry of this DNA into mouse 
cells, the plasmid replicates to about 100 to 200 copies per cell. Transcription of the 
inserted cDNA does not require integration of the plasmid into the host's chromosome, 

10 thereby yielding a high level of expression. These vectors can be used for stable 
expression by including a selectable marker in the plasmid, such as the neo gene. 
Alternatively, the retroviral genome can be modified for use as a vector capable of 
introducing and directing the expression of the fluorescent indicator gene in host cells 
(Cone & Mulligan, Proc. Natl Acad ScL USA, 81:6349-6353, 1984). High level 

15 expression may also be achieved using mducible promoters, including, but not limited 
to, the metallothionine IIA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. Rather than using expression vectors which contain viral origins of 
r^licadon, host cells can be transfomied with the fluorescent indicator cDNA controlled 

20 by appropriate expression control elements (e.g., promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. The 
selectable marker in the recombinant plasmid confers resistance to the selection and 
allows cells to stably integrate the plasmid into their chromosomes and grow to form foci 
which in turn can be cloned and expanded into cell lines. For example, following the 

25 introduction of foreign DNA, engineered cells may be allowed to grow for 1-2 days in 
an enriched media, and then are switched to a selective media. A number of selection 
systems may be used, including but not limited to the herpes simplex virus thymidine 
kinase (Wigler, et al. Cell 11: 223, 1977), hypoxanthine-guanine 
phosphoribosfyltransferase (Szybalska & Szybalski, Proc. Natl Acad ScL USA, 48:2026, 

30 1962), and adenine phosphoribosyltransferase (Lowy, et al. Cell, 22: 817, 1980) genes 
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can be employed in tk*, hgprt" or aprt" cells respectively. Also, antimetabolite resistance 
can be used as the basis of selection for dhfr, which confers resistance to methotrexate 
(Wigler, et al, Proc. Natl Acad ScL USA, 77: 3567, 1980; O^Hare, et al, Proc, Natl 
Acad. ScL USA, 8: 1527, 1981); gpt, which confers resistance to mycophenolic acid 

5 (Mulligan & Berg, Proc. Natl Acad ScL USA, 78: 2072, 1981; neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, et al, J. Mol Biol, 150:1, 
1981); and hygro, which confers resistance to hygromycin (Santerre, et al. Gene, 30: 
147, 1984) genes. Recently, additional selectable genes have been described, namely 
trpB, which allows cells to utUize indole in place of tryptophan; hisD, which allows cells 

10 to utilize histinol in place of histidine (Hartman & Mulligan, Proc. Natl Acad ScL USA, 
85:8047, 1988); and ODC (ornithine decarboxylase) which confers resistance to the 
ornithine decarboxylase inhibitor, 2-(difluoromethyl)-DL-omithine, DFMO 
(McConlogue L., In: Current Communications in Molecular Biology, Cold Spring 
Harbor Laboratory, ed., 1987). 

15 DNA sequences encoding the fluorescence indicator polypeptide of the invention can be 
expressed in vitro by DNA transfer into a suitable host cell. "Host cells" are cells in 
which a vector can be propagated and its DNA expressed. The term also includes any 
progeny of the subject host cell. It is understood that all progeny may not be identical 
to the parental cell since there may be mutations that occur during replication. However, 

20 such progeny are included when the term "host cell" is used. Methods of stable transfer, 
in other words when the foreign DNA is continuously maintained in the host, are known 
in the art. 

Recombinant fluorescent protein can be produced by expression of nucleic acid encoding 
the protein in prokaryotes, such as K coli or in eukaiyotes, such as yeast cells or 
25 mammalian cells. The fluorophore of Aequorea-relsitM fluorescent proteins results from 
cyclization and oxidation of residues 65-67. 
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The construct can also contain a tag to simplify isolation of the fluorescent indicator. For 
example, a polyhistidine tag of, e.g., six histidine residues, can be incorporated at the 
amino terminal end of the jQuorescent protein. The polyhistidine tag allows convenient 
isolation of the protein in a single step by nickel-chelate chromatography. 

5 In a preferred embodiment, the fluorescent indicator is a fusion protein produced by 
recombinant DNA technology in which a single polypeptide includes a donor moiety, a 
peptide linker moiety and an acceptor moiety. The donor moiety can be positioned at the 
amino-tenninus relative to the acceptor moiety m the polypeptide. Such a fusion protein 
has the generalized structure: (amino terminus) donor fluorescent protein moiety 

10 -peptide linker moiety -acceptor fluorescent protein moiety (carboxy terminus). 
Alternatively, the donor moiety can be positioned at the carboxy-terminus relative to the 
acceptor moiety within the fusion protein. Such a fusion protein has the generalized 
structure: (amino terminus) acceptor fluorescent protein moiety -peptide linker moiety 
-donor fluorescent protein moiety (carboxy terminus). The invention also envisions 

15 fusion proteins that contain extra amino acid seqiiences at the amino and/or carboxy 
termini, for example, polyhistidine tags. 

Thus, fluorescent indicators encoded by a recombinant nucleic acid include sequences 
coding for expression of a donor fluorescent protein moiety, an acceptor fluorescent 
protein moiety and a peptide linker moiety. The elements are selected so that upon 

20 e?q>ression into a fusion protein, the donor and acceptor moieties exhibit FRET when the 
donor moiety is excited. The recombinant nucleic acid can be incorporated into an 
expression vector comprising expression control sequences operatively linked to the 
recombinant nucleic acid. The expression vector can be adapted for function in 
prokaryotes or eukaryotes by inclusion of appropriate promoters, replication sequences, 

25 markers, etc. 

The expression vector can be transfected into a host cell for expression of the 
recombinant nucleic acid. Host cells can be selected for high levels of expression in 
order to purify the fluorescent indicator fusion protein. E. coli is useful for this purpose. 
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Altematively, the host cell can be a prokaiyotic or eukaryotic cell selected to study the 
activity of an enzyme produced by the cell. In this case, the linker peptide is selected to 
include an amino acid sequence recognized by the protease. The cell can be, e.g., a 
cultured cell or a cell in vivo. 



In another embodiment, the invention provides a transgenic non-human animal that 
expresses a nucleic acid sequence which encodes the fluorescent indicator. 

The *'non-human animals" of the invention comprise any non-human animal having 
15 nucleic acid sequence which encodes a fluorescent indicator. Such non-human animals 
include vertebrates such as rodents, non-human primates, sheep, dog, cow, pig, 
amphibians, and reptiles. Preferred non-human animals are selected from the rodent 
family including rat and mouse, most preferably mouse. The "transgenic non-human 
animals" of the invention are produced by introducing "transgenes" into the germline of 
20 the non-human animal. Embryonal target cells at various developmental stages can be 
used to introduce transgenes. Different methods are used depending on the stage of 
development of the embryonal target cell. The zygote is the best target for 
micro-injection. In the mouse, the male pronucleus reaches the size of approximately 20 
micrometers in diameter which allows reproducible injection of 1-2 pi of DNA solution. 
25 The use of zygotes as a target for gene transfer has a major advantage in that in most 
cases tiie injected DNA will be incorporated into the host gene before the first cleavage 
(Brinster et al, Proc. Natl Acad. ScL USA 82:4438-4442, 1985). As a consequence, all 
cells of the transgenic non-human animal will carry the incorporated transgene. This will 



5 A primary advantage of fluorescent indicator fusion proteins is that tiiey are prepared by ! 
normal protein biosynthesis, thus completely avoiding organic synthesis and the 

I 

requirement for customized urmatural amino acid analogs. The constructs can be ] 
expressed in E. coli in large scale for in vitro assays. Purification from bacteria is | 
simplified when the sequences include polyhistidine tags for one-step purification by f 
10 nickel-chelate chromatography. Altematiyely, the substrates can be expressed directly 
in a desired host cell for assays in situ. 
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in general also be reflected in the efficient transmission of the transgene to offspring of 
the founder since 50% of the germ cells will harbor the transgene. Microinjection of 
zygotes is the preferred method for incorporating transgenes in practicing the invention. 

• 

The term "transgenic" is used to describe an animal which includes exogenous genetic 
5 material within all of its cells. A "transgenic" animal can be produced by cross-breeding 
two chimeric animals which include exogenous genetic material within cells used in 
reproduction. Twenty-five percent of the resulting offspring will be transgenic Le,, 
animals which include the exogenous genetic material within all of their cells in both 
alleles. 50% of the resulting animals will include the exogenous genetic material within 
10 one allele and 25% will include no exogenous genetic material. 

Retroviral mfection can also be used to introduce transgene into a non-human animal. 
The developing non-human embryo can be cultured in vitro to the blastocyst stage. 
During this time, the blastomeres can be targets for retro viral infection (Jaenich, R., 
Proc. Natl Acad Sci USA 73:1260-1264, 1976). EfiBcient infection of the blastomeres 

15 is obtained by enzymatic treatment to remove the zona pellucida (Hogan, et al (1986) 
in Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.). The viral vector system used to introduce the transgene is typically a rep- 
lication-defective retro virus carrying the transgene (Jahner, et aL, Proc. Natl Acad, Sci 
USA 82:6927-6931, 1985; Van der Putten, et al. Proa Natl Acad Sci USA 

20 82:6148-6152, 1985). Transfection is easily and efiSciently obtained by culturing the 
blastomeres on a monolayer of virus-producmg cells (Van der Putten, supra; Stewart, et 
al.yEMBOJ. 6:383-388, 1987). Alternatively, infection can be performed at a later stage. 
Virus or virus-producing cells can be injected into the blastocoele (D. Jahner et al. 
Nature 298:623-628, 1982). Most of the founders will be mosaic for the transgene since 

25 mcorporation occurs only in a subset of the cells which formed the transgenic nonhuman 
animal. Further, the founder may contain various retro viral insertions of the transgene 
at different positions m the genome which generally will segregate in the offspring. In 
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addition, it is also possible to introduce transgenes into the germ line,. albeit with low 
efiSciency, by intrauterine retro viral infection of the midgestation embryo (D. Jahner et 
al^ supra), 

A third tyrpe of target cell for transgene introduction is the embryonal stem cell (ES). ES 
5 cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (M. J. Evans et al Nature 292:154-156, 1981; M.O. Bradley et al. Nature 309: 
255-258, 1984; Gossler, et al, Proc, NatL Acad Sci USA 83: 9065-9069, 1986; and 
Robertson et al. Nature 322:445-448, 1986): Transgenes can be efficiently introduced 
into the ES cells by DNA transfection or by retro virus-mediated transduction. Sudi 
10 transformed ES cells can thereafter be combined with blastocysts fiom a nonhuman 
animal. The ES cells thereafter colonize the embryo and contribute to the germ line of 
the resulting chimeric animal. (For review see Jaenisch, R., Science 240: 1468-1474, 
1988). 

« 

"Transformed" means a cell into which (or into an ancestor of which) has been 
1 5 introduced, by means of recombinant nucleic acid techniques, a heterologous nucleic acid 
molecule. "Heterologous" refers to a nucleic acid sequence that either originates from 
another species or is modified fiom either its original form or the form primarily 
expressed in the cell. 

"Transgene" means any piece of DNA which is mserted by artifice into a cell, and 
20 becomes part of the genome of the organism (/.e., either stably integrated or as a stable 
extrachromosomal element) which develops from that cell. Such a transgene may 
include a gene which is partly or entirely heterologous (i.e., foreign) to the transgenic 
organism, or may represent a gene homologous to an endogenous gene of the organism. 
Included within this definition is a transgene created by the providing of an RNA 
25 sequence which is transcribed into DNA and then incorporated into the genome. The 
transgenes of the invention include DNA sequences which encode which encodes the 
fluorescent indicator which may be expressed in a transgenic non-human animal. The 
term "transgenic" as used herein additionally includes any organism whose genome has 
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been altered by in vitro manipulation of the early embryo or fertilized egg or by any 



transgenic tedmology to induce a specific gene knockout. The term "gene knockout" as 
used herein, refers to the targeted disruption of a gene in vivo with complete loss of 
fimction that has been achieved by any transgenic technology familiar to those in the art. 
5 In one embodiment, transgenic animals having gene knockouts are those in which the 
target gene has been rendered nonfunctional by an insertion targeted to the gene to be 
rendered non-functional by homologous recombination. As used herein, the term 
"transgenic" includes any transgenic technology femiliar to those in the art which can 
produce an organism carrying an introduced transgene or one in which an endogenous 



10 gene has been rendered non-functional or "knocked out." 

Without further elaboration, it is believed that one skilled in the art can, based on the 
description herein, utilize the present invention to its fullest extent All patents and 
publications cited herein are hereby incorporated by reference. A fluorescent indicator 



15 ProLEngineering 7:109-1 15 (1994), between a blue (P4-3) and a green (S65T) GFP 
mutant, as illustrated in FIG. 2a. The chimeric cDNA was cloned at BamHUEcoRI sites 
of pRSETB (Invitrogen), HIS is the amino-terminal tag peptide containing the 
polyhistidine sequence and XCaM is Xenopus calmodulin. Chimeric proteins 
incorporating a polyhistidine tag were expressed in Escherichia coli, purified by 

20 nickel-chelate and size-exclusion chromatography, and their fluorescence characterized. 
Referring to FIG. 2a, the fluorescent CaM-based indicator ("cameleon-l") readily 
changes emission color by retracting and extaiding a long tongue (Ml 3) into and out of 
the mouth of the CaM. 



25 GFPs can be important to optimize protein folding and the Ca^^-sensitivity of FRET. 
One particularly sensitive indicator is shown in FIG. 2b. Referring to FIG. 2b, the amino 
acid and nucleotide sequences of the boundaries between P4-3 and CaM and between 
Ml 3 and S65T in cameleon-1 are shown. Cameleon-1 has an 1 1 amino acid deletion at 
the C-terminus of P4-3 and a 5 amino acid deletion at the C-terminus of M13. Two 



forCa^" 



was produced by sandwiching CaM-M13 fiision, described in Porumb, T., et al 



The amino acid composition of the boundary regions between the CaM-M13 hybrid and 
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restriction sites Sphl and Sacl are underlined, which were utilized to connect the genes 
of P4-3 and S65T to the CaM-M13 gene, respectively. To facilitate the folding of the 
GFP that is fused to any other protein, a few glycine residues are usually inserted into the 
boundary. See, for example, Porumb, T., et al, Prol Engineering 7: 109-1 15 (1994). 
5 Further, glycine residues were not introduced so that P4-3 and S65T were fused rigidly 
to the CaM and Ml 3, respectively. The rigid fusion leads to more effective transduction 
of the conformational change of CaM-M13, causing a greater change in FRET ejEBciency. 

The fluorescent indicator was efficiently expressed, and folded in bacteria and increased 
its ratio of UV-excited 510 nm to 445 nm emissions by 70% upon binding Ca^*, as shown 

10 in FIG. 3. The emission spectra of cameleon-1 were measured in 1 00 mM KCl, 20 mM 
MOPS, 1 mM EDTA, KOH to pH 7.20, before and after addition of 2 mM CaClj to give 
1 mM fiee Ca^. The Ca^^ binding to EDTA caused a local acidification of the solution, 
and a small fraction of the protein was denatured. Thus the spectrum after the Ca^^ 
addition dropped down slightly (compare with FIG. 4a). The decrease in blue and 

15 increase in green emission indicated that Ca^^ increased the efBciency of FRET from 
P4-3 to S65T, consistent with the expected decrease in distance between the two ends of 
the protein. The Ca^* response was fiilly reversible upon chelation of Ca^*. 

The Ca^*-specificity of the response of cameleon-1 was examined. Mg^^, pH, and ionic 
strength did not alter the emission spectra of either the Ca^^-saturated and 

20 Ca^^-unsaturated forms. The emission spectra of saturated and unsaturated cameleon-1 
were also not affected by hydrophobic proteins such as bovine serum albumin. Isolated 
CaM saturated with Ca^* typically becomes sticky with hydrophobic amino acids 
exposed to the surface. The CaM in cameleon-1, on the other hand, appears to interact 
preferentially with its intramolecularly-adjacent M13 peptide. The self-contained nature 

25 of the system minimizes the possibility that the protein might interact with endogenous 
CaM-binding sequences in eukarybtic cells. 
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The Ca^ binding behavior of cameleon-1 was examined. The CaM-M13 hybrid protein 
without GFPs displayed a biphasic Ca^* bmding with two dissociation constants (80 nM 
and 2 TM). See, Ponimb, T., ei al, ProtEngineering 7:109-115 (1994). Titration 
experiments revealed that the emission ratio of cameleon-1 has a biphasic Ca^^ 
dependency, as shown in FIGS. 4a and 4b. FIG. 4a shows the change in emission 
spectrum of cameleon-1 on titration with Ca^* when excited at 380 nm. The titration was 
done using KHHEDTA/KCaHEDTA solutions at pH 7.47. For clarity only two 
intermediate concentrations of Ca^^ are shown. 



FIG. 4b shows Ca^^ titration curves of cameleon-1 (open circles) and cameleon-l/E104Q 
10 (solid circles). Data points were from 4 independent experiments at different pH using 
Ca^^/EGTA and C2f /HEEDTA systems for each protein. In each experiment, the 
emission ratio (510/445 nm) change was normalized to the value of full saturation with 
Ca^^, which increased by 60-75% over the value of^^o Ca . The data of 
cameleon-l/E104Q were fitted to a four parameter logistic fiinction curve (dotted line). 
IS The data of cameleon-1 were analysed using a linear combination of 2 four parameter 
logistic function fits (solid line). The apparent dissociation constants (K'^s) for 
cameleon-1 were 68 nM and 11 TM, and the Hill coefficients were 1.8 and 1.0, 
respectively. Hie binding curve can be used to quantify the concentration of Ca^*^ present 
in the sample. Because of simulated negative cooperativity, cameleon-1 covers a very 
20 wide range of Ca^* concentration, from <10*^ to ~10"^ M. 

* 

The affinity of the CaM binding protein moiety can be modified. Many site-directed 
mutations have been studied for their effects on the Ca^^ binding and CiT -induced 
conformational changes of CaM. See, Maune, J.F., et aL,J.BioLChem, 267:5286-5295 
(1992), and Gao, Z.H., et al, J.BioLChem. 268:20096-20104 (1993). For example, a 
25 mutant chimera protein with the conserved bidentate glutamic acid at position 12 of the 
third Ca^^ binding loop of the CaM mutated to glutamine (cameleon-1 /El 04Q) was 
constructed. The mutation eliminated the high-affinity response of cameleon-1, as 
indicated in FIG. 4b, (solid circles). Cameleon-1 /El 04Q showed amonophasic response 
(K'd, 4.4 TM; Hill coefficient, 0.76), which conesponds closely to the low affinity 
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component of the cameleon-1 FRET response. Other modifications of CaM can be made 
to tune the Ca^"^ affinities for particular applications of the fluorescent indicator. 



HeLa cells were transfected with the recombinant plasmid (cameleon-l/pCDNA3) to 
determine whether cameieon-1 can work as a Ca^^ indicator in live cells. When the cells 

5 were excited with UV, however, the fluorescence of cameleon-1 was hardly detectable, 
mainly because of the dim fluorescence of the P4-3 component. Expression and folding 
of GFP at 37EC in mammalian ceUs was improved by introducing manmialian codon 
bias into the cDNA and mutating Phe64 to Leu, as in the commercially available 
construct "EGFP" (Clonetech), which encodes F64L/S65T v^th mammalian codons. 

10 The same changes were introduced into P4-3 (Y 66H/Y 1 45F), which did not change its 

fmal fluorescence properties but did improve expression in the HeLa cells. The 

* 

improved blue mutant ("EBFP") and EGFP substituted P4-3 and S65T, respectively, in 
cameleon-1 to make cameleon-2, shown in FIG. S, where Kz is Kozak*s consensus 
sequence (M. Kozak, J. Cell Biol. 108:229-241 (1989), The nucleotide sequence (SEQ 
15 ID N0:1) and amino acid sequence (SEQ ID N0:2) of cameleon-2 are shown in FIG. 7. 

The E104Q mutation v^ch afforded low Ca^^ afBnity in cameIeon-1 was also introduced 
into cameleon-2; the resulting chimera protein (cameleon-2/E104Q) cameleon-3, also 
shown in FIG. 5. The nucleotide sequence (SEQ ID N0:5) and amino acid sequence 
(SEQ ID N0:6) of cameleon-3 are shown in FIG. 9. 

20 Significant emission signals were observed &om both the EBFP and EGFP in the cells 
expressing cameleon-2 or cameleon-3. Referring to FIG. 6a, fluorescence of cameleon-2 
localized to the cytosol of HeLa cells. The fluorescence was uniformly distributed in the 
cytosolic compartment but excluded firom the nucleus, as expected for a protein of 74 
kDa Mdthout localization sequences or targeting signals. The image was taken using a 

25 330WB80 (excitation filter) and a 535DF45 (emission filter). The bar is 10 Tm. 

FIG. 6b shows time courses of the spatially averaged green:blue emission ratios fi-om two 
individual HeLa cells expressing cameleon-2. The two cells shown in FIG. 6a were 
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excited by UV (330WB80) and monitored every 15 seconds by digital imaging 
microscopy. See, for example, Tsien, R.Y. & Harootunian, A.T, Cell Calcium^ 1 1:93- 
109(1990). Hie emission bands (440DF40 and 535DF45) over the cytoplasmic regions 
were alternately sampled. In situ calibration-was performed for each of the cells. Hie 
5 pre-stimulation ratio (arrowhead) was assumed to be 50 nM, and (arrow) the value 
after saturation with Ca^"^. The calculated values for pCas 7, 6, 5, 4 are indicated by 
horizontal bars on the right side of the panel. 

Elevation of cytosolic Ca^^ concentration by saturating doses of ATP (as a purinergic 
agonist) and histamine produced significant increases in the emission ratio. Blockage 

10 of the histamine receptor by the antagonist cyproheptadine caused a rapid decrease in 
ratio, indicating the reversible behavior of the indicator. Addition of ionomycin followed 
by a high concentration (15 mM) of extracellular Ca^^ gave a large increase of the ratio 
(70-80% increase of the initial ratio value), which should correspond to the maximal ratio 
R^. Assuming the lowest ratio observed before stimulation represents th^jjl , 

IS calibration for free Ca^^ concentration can be performed. See, Adams, S.IL, ei al^ m 
Fluorescent and Luminescent Probes for Biological Activity (ed. Mason, W.T.) 
(Academic Press, 1993). 

By contrast, cameleon-3, which lacks the high affinity component of Ca^* binding to 
CaM, did not detect the changes in cytosolic Ca^ concentration signals due to ATP or 

20 histamine, but gave a similar in response to ionomycin and 20 mM extracellular 
Ca^*. Cameleon-3 is less sensitive to and buffers cytosolic Ci^ to a lesser extent than 
does caraeleon-2. The probable high Ca^"^ dissociation rate of cameleon-3 is 
advantageous for tracking rapid Ca^ release kinetics. The in vitro study revealed that 
the cameleon indicators show a relatively fast cellular response to Ca^^ concentration 

25 changes. 



Addition of a nuclear localization sequence to cameleon-2 yielded a Ca^* indicator, 
cameleon-2nu, shown in FIG. 5. The nucleotide sequence (SEQ ID N0:3) and amino 
acid sequence (SEQ ID N0:4) of cameleon-2nu are shown in FIG. 8. The fluorescence 
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of cameleon-2nu was localized to nuclei, as depicted in FIG. 6c. The time course of Ca^* 
concentrations in nuclei was followed (FIG. 6d) and was similar to the results obtained 
in the. cytosol ^IG. 6b). Agonist-induced changes in the free Ca^^ concentration inside 
the endoplasmic reticulum in mtact cells were similarly monitored. The low-afQnity 

5 indicator cameleon-3 was engineered to reside in the lumen of endoplasmic reticulum 
(ER) (cameleon-3er) (FIG. S) by addition of a localization sequence at the amino 
terminus and a KDEL signal for ER retention at the carboxy terminus of the fluorescent 
indicator. The nucleotide sequence (SEQ ID N0:7) and amino acid sequence of 
cameleon-3er (SEQ ID N0:8) are shown in FIG. 10. Reticular patterns of fluorescence 

10 were seen in HeLa cells expressing the protein. FIG. 6e is a digital fluorescence image 
of cameleon-3er in transfected HeLa cells. The image was obtained with a cooled CCD 
camera system with a 480DF30 (excitation filter) and a S3SDF4S (emission filter). The 
bar is 10 Tm. 



FIG. 6f shows a time course of the average Ca^*„ concentration of four cells obtained 
15 with a video-rate confocal microscope. Digital fluorescence images were the result of 
simultaneous acquisition of two confocal single-wavelength emission images at 450 nm 
(65 nm bandpass) and 535 nm (45 nm bandpass). After backgroimd subtraction, the ratio 
of the long wavelength image over the short wavelength one was calculated. Cells were 
illuminated for 66 msec (2 frames) for each time point. The interference filters and 
20 dichroics, and the sensitivity of the detectors ar6 different between the CCD and confocal 
microscope systems. Therefore, the ratios obtained in the two systems differ 
quantitatively. 

The pre-stimulus Ca^*„ concentration reported by cameleon-3er was consistently higher 
than cytosolic or nuclear Ca^^ concentrations reported by cameleon-2 and cameleon-2nu, 
25 respectively. Histamine reproducibly decreased the Ca^^^r concentration in all of 1 5 cells, 
whereas it always increased the cytosolic and nuclear Ca^^ concentrations. Receptor 
blockade by cyproheptadine reversed the decrease Ca^^^r concentration, indicating refill 
of the Ca^^ pools. In FIG. 6f, the ratio did not reach the value of the resting state, 
whereas complete reversion was observed in five other experiments. Addition of 
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ionomycin and 20 mM extracellular Ca^* increased Ca^*„ concentration to a saturating 
value above the starting level. Changes in Ca^*„ concentration were generally slower 
than those of the cytosol and nuclei. 



The targetability of the fluorescent indicators can permit Ca^'^ measurements at 
5 previously inaccessible sites such as the immediate vicinity of synaptic vesicles or Ca^* 
channels, and in genetically tractable organisms that do not load well with esters of 
small-molecule indicators. 



The Examples described below are illustrative of the disclosed method; however, many 
alternatives, modifications and variations will be clear to those skilled in the art 

10 Examples 

Gene construction 

The cDNA of the GFP mutant P4-3 was amplified by PGR with a sense primer 
containing a BamHl site, and a reverse pruner containing an SphI site and eliminating the 
GFP stop codon. See, for example, Heim, It & Tsien, R.Y. Current Biol 6:178-182 

15 (1996). Likewise, the cDNA of S65T was amplified with a Sacl site and an EcoSl site 
introduced to the S' and 3* ends of the gene, respectively. Two restriction sites {Sphl and 
Sacl) were introduced by PGR into 5* and 3' ends of the GaM-M13 gene, respectively, 
using the pHYl as a template. See, Porumb, T., et al ProL Engineering 7:109-115 
(1994). All the amplification reactions were done by Pfii polymerase (Stratagene). The 

20 restricted products were ligated and cloned in-fi:ame into the BamHUEco'Rl sites of 
pRSETB (Invitrogen). The modifications of the boundary regions between P4-3 and 
GaM and between Ml 3 and S65T were performed by PGR or by a combined use of 
restriction enzymes, Klenow fiagment of DNA polymerase I, T4 DNA polymerase, mung 
bean exonuclease, and T4 DNA ligase as described, for example, in Molecular Cloning 

IS A Laboratory Manual (eds. Sambrook, J., Fritsch, E.F. & Maniatis, T.) (GSH Laboratoiy 
Press, 1989). The phEGFP plasmid was commercially available Scorn Glontech, Two 
amino acid substitutions (Y66H and Y145F) were made in hEGFP to construct EBFP. 
Oligonucleotide-directed mutageneses were carried out using the Muta-Gene Phagemid 
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in vitro kit (Bio-Rad) at the codons for Y66H and Y 145F of EGFP, and for E104Q of the 
calmodulin. The 5' end of the EBFP gene was modified by PCR to have a ifmdin 
restriction site followed by a Kozak's consensus sequence (ACCGCC-ATG). The 
//indIII/£coRI fragment encoding the entire chimeric protein was subcloned in the 
5 manmialian expression vector pCDNA3 ^vitrogen). 

For cameleon-2nu, the cameleon-2 DNA was extended by PCR at the 3* end with the 
sequence encoding the nuclear localization sequence (PKKKRKVEDP). See» Forbes, 
DJ. Ann.Rev.Cell Biol 8:495-527 (1992), Cameleon-3er DNA was likewise obtained 
by extending the cameleon-3 DNA at the 5* end with the sequence encoding the 
10 localization sequence peptide from calreticulin (^IlJLPVPLLLGLLGLAAAD), and at 
the 3' end v^th the sequence encoding the ER retention signal (KDEL), See, Kendall, 
J.M. et al, Biochem. Biophys. Res. Commun, 189:1008-1016 (1992). 

Protein expression and spectroscopy 

The expression of chimera proteins in bacteria was performed using the T7 expresson 
1 5 system (pRSETB/JMl 09(DE3)). Cultures were grown at room temperature, and protein 
expression was induced by isopropyl B-D-thiogalactoside. Cells were lysed by a French 
press. The polyhistidine-tagged chimera proteins were purified from the cleared lysates 
on nickel-chelate columns (Qiagen). The protein samples in the eluates were 
concentrated by Centricon 30 (Amicon), and were further purified by gel-filtration 
20 column to remove abortive chimera proteins which resulted 6om proteolysis or 
misfolding. Emission spectra of the purified proteins were measured using a fluorometer 
(Spex Industries, Edison, NJ) at excitation 380 nm. 

Ca^^ titration and calibration 

The titration experiments were performed by the "pH-metric method" as described in 
25 Grzegorz, G., et al, J. Biol Chem. 260:3440-3450 (1985). In situ calibration for 
cytosolic Ca^^ concentration utilized the equation: 

[Ca^1c=K',((R-R„J/(R^-R)y»'"»> 
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where K*,, is the apparent dissociation constant corresponding to the Ca^* concentration 
at which R is midway between R^^^ and R„i„, and nH is the Hill coeflBcient 



Imaging 

Two to five days after the cDNA transfection with lipofectin (Gibco BRL), the cells were 
5 imaged on a Zeiss Axiovert microscope with a cooled CCD camera (Photometries, 
Tucson, AZ) interfaced to a personal computer. The program MetaFluor 2.75 (Universal 
Imaging) was used for controlling data acquisition and analysis. Dual-emission ratio 
imaging was carried out by manually switching the two emission filters (440DF40 for 
EBFP, 535DF45 for EGFP) in fix)nt of a single imaging camera. The excitation filter 
10 (330WB80) was used with a 420DRLP dichroic mirror. Digital fluorescence imaging 
with a video-rate confocal microscope was performed as described in Tsien, R.Y. & 
Backskai, BJ. Handbook of Biological Confocal Microscopy (ed. Pawley, J.B.) (Plenum 
Press, New York, 1995) p. 459-478. Cells were illuminated with wide band UV 
(351-364 nm) fi-om an Ar^ ion laser. The primary dichroic (380DRLP) reflects UV and 
15 transmits light emitted 6om the specimen, which is subsequently split by a secondaiy 
dichroic (505DRLP) into two broad bands: EBFP emission (450DF65) and EGFP 
emission (535DF45), and counted by photomultiplier tubes. 
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(111) NUMBER OF SEQUENCES; 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
IB) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1,30 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/818,252 

(B) FILING DATE: 14 March 1997 

(C) CLASSIFICATION: 
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SEQUENCE LISTING j 

(1) GENERAL INFORMATION: j 

(i) APPLICANT: The Regents of the University of California | 

(ii) TITLE OF INVENTION: Fluorescent Protein Sensors for \ 

Detection of Analyates ! 



(vii) PRIOR APPLICATION DATA: J 

(A) APPLICATION NUMBER US 0i9/818,2S3 \ 

(B) FILING DATE: 14 MARCH 1997 ] 

(C) CLASSIFICATION: i 



(vii) PRIOR APPLICATION DATA: i 

(A) APPLICATION NUMBER: US 08/919,143 

(B) FILING DATE: 27 AUGUST 1997 'i 

(C) CLASSIFICATION: \ 



(viii) ATTORNEY/ AGENT INFORMATION: 1 

(A) NAME: Haile, Lisa A. % 

(B) REGISTRATION NUMBER: 38,347 \ 

(C) REFERENCE/DOCKET NUMBER: 07257/058WO1 | 

(ix) TELECOMMUNICATION INFORMATION: | 

(A) TELEPHONE: 619/678-5070 | 

(B) TELEFAX: 619/678-5099 \ 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1929 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: j 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu ^ 

5 10 15 I 



t 
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GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 ' 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 ^5 60 

CTG ACC CAT GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TTC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC T^C CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC CGC ATG CAT GAC CAA CTG ACA GAA GAG CAG ATT GCA 720 
Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 
225 230 235 240 

GAG TTC AAA GAA GCC TTC TCA TTA TTC GAC AAG GAT GGG GAC GGC ACC 768 
Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 

ATC ACC ACA AAG GAA CTT GGC ACC GTT ATG AGG TCG CTT GGA CAA AAC 816 
He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 
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CCA ACG GAA GCA GAA TTG CAG GAT ATG ATC AAT GAA GTC GAT GCT GAT 864 
Pro Thr Glu Ala Glu Leu Gin Asp Met lie Asn Glu Val Asp Ala Asp 
275 260 285 

GGC AAT GGA ACG ATT TAG TTT CCT GAA TTT CTT ACT ATG ATG GCT AGA 912 
Gly Asn Gly Thr lie Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 

AAA ATG AAG GAC ACA GAC AGC GAA GAG GAA ATC CGA GAA GCA TTC CGT 960 
Lys Met Lys Asp Thr Asp Ser Glu Glu Glu lie Arg Glu Ala Phe Arg 
305 310 315 320 

GTT TTT GAC AAG GAT GGG AAC GGC TAC ATC AGC GCT GCT GAA TTA CGT 1008 
Val Phe Asp Lys Asp Gly Asn Gly Tyr lie Ser Ala Ala Glu Leu Arg 

325 330 335 

CAC GTC ATG ACA AAC CTC GGG GAG AAG TTA ACA GAT GAA GAA GTT GAT 1056 
His Val Met Thr Asn Leu Gly Glu Lya Leu Thr Asp Glu Glu Val Asp 

340 345 350 

GAA ATG ATA AGG GAA GCA GAT ATC GAT 6GT GAT . GGC CAA GTA AAC TAT 1104 
Glu Met lie Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

GAA GAG TTT GTA CAA ATG ATG ACA GCA AAG GGG GGG TVAG AGG CGC TGG 1152 
Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 380 

AAG AAA AAC TTC ATT GCC GTC AGC GCT GCC AAC CGG TTC AAG AAG ATC 1200 
Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He 
385 390 395 400 

TCC GAG CTC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG 1248 
Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC 1296 
Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

420 425 430 

GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG 1344 
Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC 13^2 
Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
450 455 460 

GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC 1440 
Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 

CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC 1488 
His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

485 490 495 

GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC 1536 
Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

500 505 510 

CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG 1584 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn . Arg He Glu 
515 520 525 
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CTG AAG GGC ATC GAC TTC AAG GAG GAG GGC AAC ATC CTG GGG CAC AAG 1632 
Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys 
530 535 540 

CTG GAG TAG AAC TAG AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG 1680 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys 
545 550 555 560 

CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG 1728 
Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 

565 570 575 

GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC 1776 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 

580 585 590 

GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG 1824 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
595 600 605 

TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG 1872 
Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
610 615 620 

CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG 1920 
Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
625 630 635 640 

.TAG AAG TAA 1929 
Tyr Lys 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQX7ENCE CHARACTERISTICS 

(A) LENGTH: 642 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS ; 

(D) TOPOLOGY: LINEAR 

(il) MOLECULE TYPE: PROTEIN 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


lieu 


Phe 


Thr 


Gly Val Val 


Pro 


He Leu 










5 










■ 10 






15 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys Phe Ser 


Val 


Ser Gly 








20 










25 






30 


Glu Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu Thr Leu 


Lys 


Phe He 






35 










40 






45 




Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro Thr Leu 


Val 


Thr Thr 




50 










55 








60 






Leu 


Thr 


His 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr Pro Asp His Met Lys 


65 










70 










75 




80 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu Gly Tyr Val Gin Glu 










85 










90 






95 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr Lys Thr 


Arg 


Ala Glu 








100 










105 






110 




Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg He Glu 


Leu 


Lys Gly 






115 










120 






125 
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He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 
225 230 235 240 

Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 

He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 

Pro Thr Glu Ala Glu Leu Gin Asp Met lie Asn Glu Val Asp Ala Asp 
275 280 285 

Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 

Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 
305 310 315 320 

Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Glu Leu Arg 

325 330 335 

His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 

340 345 350 

Glu Met He Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 380 

Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He 
385 390 395 400 

Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe- Ser 

420 425 ' 430 

Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
450 455 460 

Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 
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His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

485 490 495 

Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

500 505 510 

Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu 
515 520 525 

Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 
530 ,535 540 

Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 
545 550 555 560 

Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 

565 570 575 

Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 

580 585 590 

Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
595 600 605 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
610 615 620 

Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
625 630 635 640 

Tyr Lys 



(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1959 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



1 
t 

i 



■* 



4 

r« 

■i 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 ' 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

5 10 15 i 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly i 

20 25 30 ' • ^ 
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CTG ACC CAT GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG 240 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 



336 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



384 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



432 



AAC TTC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 460 
Asn Phe Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 



GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 



576 



CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



624 



AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



672 



GTG ACC GCC GCC CGC ATG CAT GAC CAA CTG ACA GAA GAG CAG ATT GCA 720 
Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin lie Ala 
225 230 235 240 

ft 

GAG TTC AAA GAA GCC TTC TCA TTA TTC GAC AAG GAT GGG GAC GGC ACC 768. 
Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 



ATC ACC ACA AAG GAA CTT GGC ACC GTT ATG AGG TCG CTT GGA CAA AAC 816 
He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 

CCA ACG GAA GCA GAA TTG CAG GAT ATG ATC AAT GAA GTC GAT GCT GAT 864 
Pro Thr Glu Ala Glu Leu Gin Asp Met He Asn Glu Val Asp Ala Asp 
275 280 285 

GGC AAT GGA ACG ATT TAC TTT CCT GAA TTT CTT ACT ATG ATG GCT AGA 912 
Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 



AAA ATG AAG GAC ACA GAC AGC GAA GAG GAA ATC CGA GAA GCA TTC CGT 
Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 
305 310 315 320 



960 
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GTT TTT GAC AAG GAT GGG AAC GGC TAG ATC AGC GCT GCT GAA TTA CGT 1008 
Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Glu Leu Arg 

325 330 335 

CAC GTC ATG ACA AAC CTC GGG GAG AAG TTA ACA GAT GAA GAA GTT GAT 1056 
His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 

340 345 350 

GAA ATG ATA AGG GAA GCA GAT ATC GAT GGT GAT GGC CAA GTA AAC TAT 1104 
Glu Met He Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

GAA GAG TTT GTA CAA ATG ATG ACA GCA AAG GGG GGG AAG AGG CGC TGG 1152 
Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 380 



AAG AAA AAC TTC ATT GCC GTC AGC GCT GCC AAC CGG TTC AAG AAG ATC 1200 
Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He 
385 390 395 400 

TCC GAG CTC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG 1248 
Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC 1296 
Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

420 425 : 430 

GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAG GGC AAG CTG ACC CTG 1344 
Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC 1392 
Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
450 455 460 

GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAG CCC GAC 1440 
Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 



CAC ATG AAG CAG CAC GAC TTC TTC AAG 
His Met Lys Gin His Asp Phe Phe Lys 

485 



TCC GCC ATG CCC GAA GGC TAC 1488 
Ser Ala Met Pro Glu Gly Tyr 
490 495 



GTC CAG GAG CGC ACC 
Val Gin Glu Arg Thr 

500 


ATC 
He 


TTC 

Phe 


TTC 
Phe 


AAG 
Lys 
505 


GAC 
Asp 


GAC GGC AAC 
Asp Gly Asn 


TAC 
Tyr 
510 


AAG 
Lys 


ACC 
Thr 


153.6 


CGC GCC GAG GTG AAG 
Arg Ala Glu Val Lys 
515 


TTC GAG GGC 
Phe Glu Gly 
520 


GAC 
Asp 


ACC 
Thr 


CTG 

Leu 


GTG 
Val 


AAC 
Asn 
525 


CGC 
Arg 


ATC 

He 


GAG 
Glu 


1584 


CTG AAG GGC ATC GAC 
Leu Lys Gly He Asp 
530 


TTC 
Phe 


AAG 
Lys 
535 


GAG 
Glu 


GAC 
Asp 


GGC 
Gly 


AAC 
Asn 


ATC 
He 
540 


CTG 
Leu 


GGG 
Gly 


CAC 
His 


AAG 
Lys 


1632 


CTG GAG TAC AAC TAC 
Leu Glu Tyr Asn Tyr 
545 


AAC 
Asn 
550 


AGC 
Ser 


CAC 
His 


AAC 
Asn 


GTC 
Val 


TAT 
Tyr 
555 


ATC 
He 


ATG 
Met 


GCC 
Ala 


GAC 
Asp 


AAG 
Lys 
560 


1680 


CAG AAG AAC GGC ATC 
Gin Lys Asn Gly He 

565 


AAG 
Lys 


GTG 
Val 


AAC 
Asn 


TTC 
Phe 


AAG 
Lys 
570 


ATC CGC CAC 
He Arg His 


AAC 
.Asn 


ATC 
He 
575 


GAG 
Glu 


1728 
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GAC GGC AGC GTG CAG CTC GCC GAC CAC TAG GAG GAG AAC AGG GGG ATG 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 

580 585 590 



1776 



GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC GAC TAG GTG AGG AGC CAG 1824 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
595 600 605 

TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTG CTG 1872 
Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
610 615 620 



CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG 
Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu 
625 630 635 640 



1920 



TAG AAG CCA AAA AAG AAG AGA AAG GTG GAA GAC GGT TAA 
1959 

Tyr Lys Pro Lys Lys Lys Arg Lys Val Glu Asp Ala. 

645 650 



(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 652 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala ^hr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



■J 
t 



3 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Phe Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



■ 



4 

* 

1 
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Gly He Lys Val Asn Phe Lya He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220. 

Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 
225 230 235 ' 240 

Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 

He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 

•Pro Thr Glu Ala Glu Leu Gin Asp Met He Asn Glu Val Asp Ala Asp 
275 280 285 

Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 

Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 
305 310 315 320 

Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Glu Leu Arg 

325 330 335 

His Val Met Thr Asn Leu Gly Glu Lya Leu Thr Asp Glu Glu Val Asp 

340 345 350 

Glu Met He Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 • 380 

Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He 
385 390 395 400 

Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

420 425 430 

Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
450 455 460 

Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 

His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

485 490 495 
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Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

500 505 510 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Sex Gly 

20 25 30 

« 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu * 
515 520 525 i 

Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys ! 
530 535 540 j 

Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys ' 
545 550 555 560 

• ■ > 

« 

Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu ■ 

565 570 575 

Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 

580 585 590 

Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin \ 
595 600 605 t 

■ 

{ 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 

610 615 620 7- 

Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
625 630 635 640 

« 

Tyr Lys Pro Lys Lys Lys Arg Lys Val Glu Asp Ala 

645 650 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1929 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu ( 

5 10 15 { 
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CTG ACC CAT GGC GTG GAG TGC TTC AGC CGC TAG CCC GAC CAC ATG AAG 240 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TTC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC CGC ATG CAT GAC CAA CTG ACA GAA GAG CAG ATT GCA 720 
Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 
225 230 235 240 

GAG TTC AAA GAA GCC TTC TCA TTA TTC GAC AAG GAT GGG GAC GGC ACC 768 
Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 

ATC ACC ACA AAG GAA CTT GGC ACC GTT ATG AGG TCG CTT GGA CAA AAC 816 
He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 

CCA ACG GAA GCA GAA TTG CAG GAT ATG ATC AAT GAA GTC GAT GCT GAT 864 
Pro Thr Glu Ala Glu Leu Gin Asp Met He Asn Glu Val Asp Ala Asp 
275 280 285 

GGC AAT GGA ACG ATT TAC TTT CCT GAA TTT CTT ACT ATG ATG GCT AGA 912 
Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 

AAA ATG AAG GAC ACA GAC AGC GAA GAG GAA ATC CGA GAA GCA TTC CGT 960 
Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 
305 310 315 320 
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GTT TTT GAC AAG GAT GGG AAC GGC TAG ATC AGC GCT GOT GAG TTA CGT 1008 
Val Phe Asp Lys Asp Gly Asn Gly Tyr lie Ser Ala Ala Gin Leu Arg 

325 330 335 

GAC GTC ATG ACA AAC CTC GGG GAG AAG TTA ACA GAT GAA GAA GTT GAT 1056 

His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 

340 345 350 

GAA ATG ATA AGG GAA GCA GAT ATC GAT GGT GAT GGC CAA GTA AAC TAT 1104 
Glu Met lie Arg Glu Ala Asp lie Asp Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

GAA GAG TTT GTA CAA ATG ATG ACA GCA AAG GGG GGG AAG AGG CGC TGG 1152 
Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 380 

AAG AAA AAC TTC ATT GCC GTC AGC GCT GCC AAC CGG TTC AAG AAG ATC 1200 
Lys Lys Asn Phe lie Ala Val Ser Ala Ala Asn Arg Phe Lys Lys lie 
385 390 395 400 

TCC GAG CTC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG 1248 
Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC 1296 
Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

420 425 430 

GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG 1344 
Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC 1392 
Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
450 455 460 

GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC 1440 
Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 

CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC 1488 
His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

• 485 490 495 

GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC 1536 
Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

500 505 510 

CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG 1584 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu 
515 520 525 

CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG 1632 
Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 
530 535 540 

CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG 1680 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 
545 550 555 560 

CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG 1728 
Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 

565 570 575 
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GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC 
Asp Gly Sex Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 

580 585 590 



1776 



GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
595 600 605 



1824 



TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG 
Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
610 615 620 



1872 



CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG 
Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu 
625 630 635 ' 640 



1920 



TAC AAG TAA 

1929 
Tyr Lys 



(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 642 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

5 10 . 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

145 150 155 160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin lie Ala 
225 230 235 240 

Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

245 250 255 

lie Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 

260 265 270 

Pro Thr Glu Ala Glu Leu Gin Asp Met lie Asn Glu Val Asp Ala Asp 
275 280 285 

Gly Asn Gly Thr lie Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
290 295 300 

Lys Met Lys Asp Thr Asp Ser Glu Glu Glu lie Arg Glu Ala Phe Arg 
305 310 315 320 

Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Gin Leu Arg 

325 330 335 

His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 

340 345 350 

Glu Met He Arg Glu Ala Asp lie Asp. Gly Asp Gly Gin Val Asn Tyr 
355 360 365 

Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
370 375 380 

Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He 
385 390 395 400 

Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

405 410 415 

Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

420 425 430 

Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
435 440 445 

Lys Phe He Cys Thr Thr Gly Lys Leu. Pro Val Pro Trp Pro Thr Leu 
450 455 460 

Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
465 470 475 480 

His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

485 490 495 

Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 

500 505 510 

Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu 
515 520 525 
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Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys 
530 535 540 

Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys 
545 550 555 560 

Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu 

565 570 575 

Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 

580 585 590 

Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
595 600 605 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
610 615 620 

Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
625 630 635 640 

Tyr Lys 

(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE C3IARACTERISTICS 

(A) LENGTH: 1971 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG CTG CTG CCC GTC CCC CTG CTG CTG GGC CTG CTG GGC GCC GCC GCC 48 
Met Leu Leu Pro Val Pro Leu Leu Leu Gly Leu Leu Gly Ala Ala Ala 

5 10 15 

GAC GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 96 
Asp Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

20 25 30 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 144 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
35 40 45 

.GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 192 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
50 55 60 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 240 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
65 70 75 80 

CTG ACC CAT GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 288 
Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

85 90 95 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 336 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

100 105 110 
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CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 384 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
115 120 125 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 432 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
130 135 140 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 480 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
145 150 155 160 

AAC TTC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 528 
Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

165 170 175 

GGC ATC MiG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 576 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

180 185 190 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 624 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
195 200 205 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 672 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
210 215 220 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 720 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
225 230 235 240 

GTG ACC GCC GCC CGC ATG CAT GAC CAA CTG ACA GAA GAG CAG ATT GCA 768 
Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 

245 250 255 

GAG TTC AAA GAA GCC TTC TCA TTA TTC GAC AAG GAT GGG GAC GGC ACC 816 
Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

260 265 270 

ATC ACC ACA AAG GAA CTT GGC ACC GTT ATG AGG TCG CTT GGA CAA AAC 864 
He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 
275 280 285 

CCA ACG GAA GCA GAA TTG CAG GAT ATG ATC AAT GAA GTC GAT GCT GAT 912. 
Pro Thr Glu Ala Glu Leu Gin Asp Met He Asn Glu Val Asp Ala Asp 
290 295 300 

GGC AAT GGA ACG ATT TAC TTT CCT GAA TTT CTT ACT ATG ATG GCT AGA 960 
Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
305 310 315 320 

AAA ATG AAG GAC ACA GAC AGC GAA GAG GAA ATC CGA GAA GCA TTC CGT 1008 
Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 

325 330 335 

GTT TTT GAC AAG GAT GGG AAC GGC TAC ATC AGC GCT GCT CAG TTA CGT 1056 
Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Gin Leu Arg 

340 345 350 



CAC GTC ATG ACA AAC CTC GGG GAG AAG TTA ACA GAT G7^ GAA GTT GAT 1104 
His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 
355 360 365 
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GAC GGC AGC GTG CAG CTC GCC GAC CAC TAG CAG CAG AAC ACC CCC ATC 1824 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 



GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG 1872 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
610 615 620 



- 64 . 

GAA ATG ATA AGG GAA GCA GAT ATC GAT GGT GAT GGC CAA GTA AAd TAT 1152 
Glu Met He Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
370 375 380 

GAA GAG TTT GTA CAA ATG ATG ACA GCA AAG GGG GGG AAG AGG CGC TGG 1200 
Glu Glu Phe Val Gin Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 

385 390 395 400 S 

AAG AAA AAC TTC ATT GCC GTC AGC GCT GCC AAC CGG TTC AAG AAG ATC 1248 1 
Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys He i 

405 410 415 i 



i 



1 



TCC GAG CTC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG 1296 ! 
Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

420 425 430 



/ 



CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC 1344 
Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
435 440 445 

GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG 1392 
Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 

450 455 460 ~- 

AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC 1440 
Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
465 470 475 480 

GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC 1488 
Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 

485 490 495 

CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC 1536 
His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 

500 505 510 

GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC 1584 
Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
515 520 525 

t 

CGC GCC GAG GTG AAG TTC GAG GGC . GAC ACC CTG GTG AAC CGC ATC GAG 1632 \ 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu i 
530 535 540 j 

i 

CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG 1680 | 
Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys } 
545 550 555 560 I 



1 



CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG 1728 \ 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 

565 570 575 

CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG 1776 
Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 

580 585 590 



595 600 605 ! 
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TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG 1920 
Ser Ala Leu Ser Lys Asp Pro Asn Glii Lys Arg Asp His Met Val Leu 
625 630 635 640 

CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC AAG GAC GAG CTO 1968 
Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Lys Asp Glu Leu 

645 650 655 

TAA X971 
(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 656 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Leu Pro Val Pro Leu Leu Leu Gly Leu Leu Gly Ala Ala Ala 

5 10 15 

Asp Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

20 25 30 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
35 40 45 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
50 55 60 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
65 70 ,75 80 

Leu Thr His. Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

85 90 95 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

100 105 110 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
115 120 125 , 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
130 135 140 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
145 150 155 160 

Asn Phe Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 

165 170 175 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

180 185 190 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
195 200 205 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
210 215 220 
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Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
225 230 235 240 

Val Thr Ala Ala Arg Met His Asp Gin Leu Thr Glu Glu Gin He Ala 

245 250 255 

Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp Lys Asp Gly Asp Gly Thr 

260 265 270 

He Thr Thr Lys Glu Leu Gly Thr Val Met Arg Ser Leu Gly Gin Asn 
275 280 285 

Pro Thr Glu Ala Glu Leu Gin Asp Met He Asn Glu Val Asp Ala Asd 
290 295 300 

Gly Asn Gly Thr He Tyr Phe Pro Glu Phe Leu Thr Met Met Ala Arg 
305 310 315 . 320 

Lys Met Lys Asp Thr Asp Ser Glu Glu Glu He Arg Glu Ala Phe Arg 

325 330 335 

Val Phe Asp Lys Asp Gly Asn Gly Tyr He Ser Ala Ala Gin Leu Arg 

340 345 350 



His Val Met Thr Asn Leu Gly Glu Lys Leu Thr Asp Glu Glu Val Asp 

355 360 365 

Glu Met He Arg Glu Ala Asp He Asp Gly Asp Gly Gin Val Asn Tyr 
370 375 380 

Glu Glu Phe Val Glii Met Met Thr Ala Lys Gly Gly Lys Arg Arg Trp 
385 390 395 400 

Lys Lys Asn Phe He Ala Val Ser Ala Ala Asn Arg Phe Lys Lys lie 

405 410 415 

Ser Glu Leu Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 

420 425 430 

Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 

435 440 445 



Val Ser Gly Glu Gly Glu 
450 

Lys Phe He Cys Thr Thr 
465 470 

Val Thr Thr Leu Thr Tyr 

485 

His Met Lys Gin His Asp 

500 

Val Gin Glu Arg Thr He 
515 

Arg Ala Glu Val Lys Phe 
530 



Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
455 460 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 

475 480 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 

490 495 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
505 510 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
520 525 

Glu Gly Asp Thr Leu Val Asn Arg He Glu 
535 540 



Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 
545 550 555 560 
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Leu Glu. Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 

565 570 575 

Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 

580 585 590 

Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 
595 600 605 

Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
610 615 620 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
625 630 635 640 

Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Lys Asp Glu Leu 

645 650 655 
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From the above description, the essential characteristics of the present invention can be 
ascertained, ^thout departing ftom the spirit and scope thereof, various changes and 
modifications of the invention can be made to adapt it to various usages and conditions. 
Thus, other embodiments are also within the claims. 
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What is claimed is: 

1 . A nucleic acid molecule, comprising a polynucleotide that encodes a fluorescent 
protein energy transfer partner, comprising: 

a first chemical entity binding region that binds to a chemical entity and binds 
to a second protein partner, and 

a fluorescent protein in frame with said first chemical entity binding region; 
wherein said fluorescent protein is a first energy transfer partner for a second 
energy transfer partner attached to said second protein partner. 

2. The nucleic acid molecule of claim 1, wherein said fluorescent protein is an 
Aequorea-TGlattd fluorescent protein. 

3. The nucleic acid molecule of claim 1, wherein said fu*st chemical entity bindmg 
region comprises a bound conformation of smaller volume when said chemical 
entity is bound to said first chemical entity bmding region compared to said first 
chemical entity binding region when said chemical entity is not bound to said first 
chemical entity binding region and said bound conformation permits increased 
energy transfer between said first energy transfer partner and said second energy 
transfer partner. 

4. The nucleic acid molecule of claim 3, wherein said bound conformation increases 
the afiBnity between said first chemical entity binding region and said second 
protein partner. 

5. The nucleic acid molecule of claim 2, wherein said first chemical entity binding 
region binds to a protein association region of said second protein partner. 

6. The nucleic acid molecule of claim 3, wherein said chemical entity is an analyte. 
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7. The nucleic acid molecule of claim 2, wherein said first chemical entity binding 
region is either calmodulin, a cahnodulin-related protein moiety, cGMP-dependent 
protein kinase, a steroid hormone receptor, a ligand binding domain of a steroid 
hormone receptor, protein kinase C, mositol-l,4,5-triphosphate receptor, or 
recoverin. 



8. The nucleic acid molecule of claim 3, wherein said first chemical entity binding 
region is calmodulin or a calmodulin-related protein. 

9. The nucleic acid molecule of claim 1, wherein said fluorescent protein energy 
transfer partner further comprises a second energy transfer partner in frame with 
said first energy transfer partner. 



1 0. The nucleic acid molecule of claim 9, wherein said first energy transfer partner is 
either P4-3, EBFP, or WIB, and said second energy transfer partner is either S65T, 
EGFP, or 10c. 

11. The nucleic acid molecule of claim 1, wherein said fluorescent protein energy 
transfer partner further comprises a second protein partner comprises said second 
energy transfer partner in fiame with said fust energy transfer partner. 

1 2. The nucleic acid molecule of claim 1 1 , wherein said second protein partner further 
comprises a protem association region. 

13. The nucleic acid molecule of claim 11, wherein said second protem partner is 
either a calmodulin-binding domain of skMLCKp, smMLCK, CaMKII, 
Caldesmon, Calspermin, phosphofiiictokinase calcineurin, phosphorylase kinase, 
Ca2+-ATPase 59 kDa PDE, 60 kDa PDE, nitric oxide synthase, type I adenylyl 
cylclase, Bordetella pertussis adenylyl cyclase, Neuromodulin, Spectrin, 
MARCKS, F52, JJ-Adducin, HSP90a, HIV-1 gpl60, BBMHBI, Dilute MHC, M- 
astoparan, Melittin, Glucagon, Secretin, VIP, GIP, or Model Peptide CBP2. 
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14. The nucleic acid molecule of claim 1 3, wherein the target peptide moiety is Ml 3. 

1 5. The nucleic acid molecule of claim 1 1 , further comprising an amino acid linker in 
frame with said first chemical entity binding region and fusing said protein 
association region with said first chemical entity binding region. 

16. The nucleic acid of claim 15, wherein said fluorescent protein energy transfer 
partner further comprises a localization sequence. 

17. A nucleic acid molecule, comprising a polynucleotide that encodes: 

1 ) a first fluorescent protein energy transfer partner, comprising: 

a chemical entity binding region that binds to a chemical entity and binds to 
a second fluorescent protein energy transfer partner, and 

a first fluorescent protein in fianie with said first chemical entity bindmg 

region; and 

2) said second fluorescent protein energy transfer partner, comprising: 

a protein association region that binds to said chemical entity binding region, 

and 

a second fluorescent protein in frame with said protein association region; 

wherein said first fluorescent protein and said second fluorescent protein are 
energy transfer partners and said first fluorescent protein energy transfer partner is in 
fi'ame with said second first fluorescent protein energy transfer partner. 



1 8. The nucleic acid molecule of claim 1 7, wherein said fu:st and fluorescent protein 
are ^e^worcflf-related fluorescent proteins. 
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19. The nucleic acid molecule of claim 17, wherein said chemical entity binding 
region comprises a bound conformation of smaller volume when said chemical 
entity is bound to said chemical entity bindmg region compared to said chemical 
entity binding region when said chemical entity is not bound to said chemical 
entity binding region and said bound conformation permits increased energy 
transfer between said fust fluorescent protein energy transfer partner and said 
second fluorescent protein energy transfer partner. 

20. The nucleic acid molecule of claim 1 9, wherein said bound conformation increases 
the afifmity between said chemical entity binding region aiid said second 
fluorescent protein energy transfer partner. 

21 . The nucleic acid molecule of claim 20, wherein said chemical entity is an analyte. 

22. A system for monitoring protein-protein association, comprising: 

1) a first fluorescent protein energy transfer partner, comprising: 

a chemical entity binding region that bmds to a chemical entity and bmds to 
a second fluorescent protein energy transfer partner, and 
a first energy transfer partner; and 

2) said second fluorescent protein energy transfer partner, comprising: 

a protein association region that binds to said chemical entity binding region, 

and 

a second energy transfer partner; 

wherein said first energy transfer partner and said energy transfer partner are 
energy transfer partners. 



23. An expression vector containing the nucleic acid sequence of claim 1 . 



24. A transgenic non-human animal comprising a nucleic acid sequence according to 
claim 1. 
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25. An expression vector comprising e^q^ression control sequences operativeiy linked 
to a nucleic acid sequence coding for the expression of a fluorescent indicator, the 
indicator comprising: 

a binding protein moiety having an analyte-bindmg region which binds an 
analyte and causes the indicator to change conformation upon exposure 
to the analyte; 

a donor fluorescent protem moiety fused to the binding protein moiety; and 
an acceptor fluorescent protein moiety fused to the binding protein moiety, 
wherein the donor moiety and the acceptor moiety change position relative 
to each other when the analyte binds to the analyte-binding region, altering 
fluorescence resonance energy transfer between the donor moiety and the 
acceptor moiety when the donor moiety is excited. 

26. The expression vector of claim 14 adapted for fimction in a piokaiyotic cell. 

27. The expression vector of claun 14 adapted for function in a eukaiyotic cell. 

28. A host cell transfected with an expression vector comprising an expression control 
sequence operativeiy linked to a sequence coding for the expression of a fluorescent 
indicator, the indicator comprising: 

1 . a binding protein moiety having an analyte-binding region which 
binds an analyte and causes the indicator to change conformation 
upon exposure to the analyte; 

2. a donor fluorescent protein moiety covalently coupled to the bmding 
protein moiety; and 

3. an acceptor fluorescent protein moiety covalently coupled to the 
binding protein moiety, 

wherein the donor moiety and the acceptor moiety change position relative to 
each other when the analyte binds to the analyte-binding region, altering 
fluorescence resonance eneiigy transfer between the donor moiety and the 
acceptor moiety when the donor moiety is excited. 
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29. 
30. 
31. 
32. 
33. 
34. 

35. 
36. 



The cell of claim 1 7, wherein the cell is a prokaiyote. 

The cell of claim 1 8, wherein the cell is E. coll 

The cell of claim 1 7, wherein the cell is a eukaiyotic cell. 

The cell of claim 20, wherein the cell is a yeast cell. 

The cell of claim 20, wherein the cell is a mammalian cell. 

A transgenic non-himian animal having a phenotype characterized by expression 
of the niicleic acid sequence of claim 1, the phenotype being conferred by a 
transgene contained in the somatic and genn cells of the mouse, the transgene 
comprising a nucleic acid sequence which encodes a fluorecent indicator specific 
antigen polypeptide. 

The transgenic non-human animal of claim 23, wherein the animal is a mouse. 

A method for producing a transgenic non-human animal having a phenotype 
characterized by expression of the nucleic acid sequence of claun 1, the method 
comprising; 

(a) mtroducing a transgene into a zygote of an animal, the transgene 
comprising a DNA construct encoding a the fluorescent indicator 
specific antigen; 

(b) transplanting the zygote into a pseudopregnant animal; 

(c) allowing the zygote to develop to term; and 

(d) identifying at least one transgenic offspring containing the 
transgene. 
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37 The method of claim 25, wherein the introducing of the transgene mto the 
embiyo is by introducing an embiyonic stem cell containing the transgene into 
the embiyo. 

38. The method of claim 25, wherein the introducmg of the transgene into the 
embiyo is by infecting the embiyo with a retrovirus containing the transgene. 

39. The method of claim 25, wherein the animal is a mouse. 



41 . The nucleic acid of claim 40, wherein the donor fluorescent piotein moiety and 
the acceptor fluorescent protem moiety are Aequorea-rclsited fluorescent protein 
moieties. 

42. The nucleic acid of claim 4 1 , wherein the indicator further includes a target 
peptide moiety and a linker moiety that covalently couples the binding protein 
and the target peptide moiety and is a peptide moiety, and the binding protein 
moiety further includes a peptide-binding region for binding a target peptide 
moiety. 



40. An isolated nucleic acid sequence which encodes a fluorescent indicator, the • 
indicator comprising: I 
a binding protein moiety having an analyte-binding region which binds an i 
analyte and causes the indicator to change conformation upon exposure to the i 
analyte; 

a donor fluorescent protein moiety covalently coupled to the binding 
protein moiety; and 

an acceptor fluorescent protein moiety covalently coupled to the binding protein 
moiety, 

wherein the donor moiety and the acceptor moiety change position relative 
to each other when the analyte binds to the analyte-binding region, altering 

fluorescence resonance energy transfer between the donor moiety and the | 
acceptor moiety when the donor nioiety is excited. 



.1 

3 
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43. The nucleic acid of claim 42, wherein the donor fluorescent protein moiety is 
fused to the bmding protein moiety and the acceptor fluorescent protein moiety is 
fused to the target peptide moiety. 

44. The nucleic acid of claim 41, wherein the binding protein moiety is calmodulm, a 
calmodulin-related protein moiety, cGMP-dependent protein kinase, a steroid 
hormone receptor, a lig3nd binding domain of a steroid hormone receptor, protein 
kinase C, inositoH,4,5-triphosphate receptor, or recoverin. 

45. The nucleic acid of claim 42, wherein the binding protein moiety is calmodulin or 
a calmodulin-related protein moiety. 

46. The nucleic acid of claim 45, wherein the donor fluorescent protein moiety is P4- 
3, EBFP, or WIB, and the acceptor fluorescent protein moiety is S65T, EGFP, or 
10c. 



49. The nucleic acid of claim 48, wherein the linker moiety is -Gly-Gly-. 

50. The nucleic acid of claim 40, wherein the indicator further comprises a 
localization sequence. 



47. The nucleic acid of claim 45, wherein the target peptide moiety is a cabnodulin- 
binding domain of skMLCKp, smMLCK, CaMKII, Caldesmon, Calspermin, p- 

i 

hosphofhictokinase calcineurin, phosphoiylase kinase, Ca2+-ATPase 59 kDa 

PDE, 60 kDa PDE, nitric oxide synthase, type I adenylyl cylclase, Bordetella \ 
pertussis adenylyl cyclase, Neuromodulin, Spectrin, MARCKS, F52, B-Adducm, 1 
HSP90a, HIV-1 gpl60, BBMHBI, Dilute MHC. Mastoparan, Melittin, Glucagon, \ 
Secretin, VIP, GIP, or Model Peptide CBP2. I 

48. Thenucleicacidofclaim47, wherein the target peptide moiety is Ml 3. 
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5L An expression vector containing the nucleic acid sequence of claim 40. 

52. A transgenic non-human animal comprising a nucleic acid sequence according to 
claim 40. 



53. An expression vector comprising expression control sequences operatively linked 
to a nucleic acid sequence coding for the expression of a fluorescent indicator, 
the indicator comprising: 

a binding protein moiety having an analyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure to the 
analyte; 

a donor fluorescent protein moiety fused to the binding protein moiety; 

and 

an acceptor fliiorescent protein moiety fiised to the binding protein moiety, 

wherein the donor moiety and the acceptor moiety change position relative 
to each other when the analyte binds to the analyte-binding region, altering 
fluorescence resonance energy transfer between the donor moiety and the 
acceptor moiety when the donor moiety is excited. 

54. The expression vector of claim 53 adapted for function in a prokaryotic cell. 

55. The expression vector of claim 53 adapted for function in a eukaryotic cell. 
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56. A host cell transfected with an e}q>ression vector comprising an expression 
control sequence operatively linked to a sequence coding for the expression of a 
fluorescent indicator, the indicator comprising: 

a binding protein moiety having an analyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure to the 
analyte; 

a donor fluorescent protein moiety covalently coupled to the binding protein 
moiety; and 

an acceptor fluorescent protein moiety covalently coupled to the binding 
protein moiety, 

wherein the donor moiety and the acceptor moiety change position relative to 
each other when the analyte binds to the analyte-binding region, altering fluorescence 
resonance energy transfer between the donor moiety and the acceptor moiety when 
the donor moiety is excited. 

57. The cell of claim 56, wherein the cell is a prokaiyote. 

58. The cell of claim 57, wherein the cell is £1 colL 

59. The cell of claim 56, wherein the cell is a eukaryotic cell. 

60. The cell of claim 59, wherein the cell is a yeast cell. 

61. The cell of claim 59, wherein the cell is a mammalian cell. 

62. A transgenic non-human animal having a phenotype characterized by expression 
of the nucleic acid sequence of claim 40, the phenotype being conferred by a 
tmnsgene contained in the somatic and germ cells of the mouse, the transgene 
comprising a nucleic acid sequence which encodes a fluorecent indicator specific 
antigen polypeptide. 
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63. The transgenic non-human animal of claim 62, wherein the animal is a mouse. 

64. A method for producing a transgenic non-human animal having a phenotype 
characterized by expression of the nucleic acid sequence of claim 40, the method 
comprising: 

(a) introducing a transgene into a :qrgote of an animal, the transgene 
comprising a DNA construct encoding a the fluorescent indicator specific 
antigen; 

(b) transplanting the zygote into a pseudopregnant animal; 

(c) allowing the zygote to develop to term; and 

ft 

(d) identifying at least one transgenic offispring containing the transgene. 

65. Hie method of claim 64, wherein the introducing of the transgene into the 
embiyo is by introducing an embiyonic stem cell containing the transgene mto 
the embiyo. 

66. The method of claim 64, wherein the introducing of the transgene into the 
embryo is by infecting the embiyo with a retrovirus containing the transgene. 



67. The method of claim 64, wherein the animal is a mouse. 
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68. A fluorescent indicator comprising: 

a binding protein moiety having ah anaiyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure to the 
analyte; 

a donor fluorescent protein moiety covalently coupled to the binding protein 
moiety; and 

an acceptor fluorescent protein moiety covalently coupled to the binding 
protein moiety, 

wherein the donor moiety and the acceptor moiety change position relative 
to each other when the analyte binds to the anaiyte-binding region, altering 
fluorescence resonance energy transfer between the donor moiety and the 
acceptor moiety when the donor moiety is excited. 

69. The indicator of claim 68, wherein the donor fluorescent protein moiety and the 
acceptor fluorescent protein moiety are Aequorea-relat&d fluorescent protein 
moieties. 



70. The indicator of claim 69, wherein the indicator further includes a target peptide 
moiety and a linker moiety that covalently couples the binding protein and the 
target peptide moiety, and the bindmg protein moiety further includes a peptide- 
binding region for binding the target peptide moiety. 

71 . The indicator of claim 70, wherein the indicator further comprises a localization 
sequence. 

72. The indicator of claim 71, wherein the localization sequence is a nuclear 
localization sequence, an endoplasmic reticulum localization sequence, a 
peroxisome localization sequence, a mitochondrial localization sequence, or a 
localized protein. 
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TS. The indicator of claim 70, wherein the donor fluorescent protein moiety is 
covalently coupled to the binding protein moiety and the acceptor fluorescent 
protein moiety is covalently coupled to the target peptide moiety. 

74. The indicator of claim 70, wherein the linker moiety is a peptide moiety. 

75. The indicator of claim 74, wherein the linker moiety includes between about 1 
amino acid residue and about 30 amino acid residues. 



76. The indicator of claim 74, wherein the mdicator is a single polypeptide. 

77. The indicator of claim 76, wherein one of the donor fluorescent protein moiety or 
the acceptor fluorescent protein moiety is covalently coupled to the carboxy 
terminus of the single polypeptide and the other of the donor fluorescent piotem 
moiety or the acceptor fluorescent protein moiety is covalently coupled to the 
amino terminus of the single polypeptide. 

78. The indicator of claim 69, wherein the binding protein moiety is cahnodulin, a 
calmodulin-related protein moiety, cGMP-dependent protein kinase, a steroid 
hormone receptor, a ligand binding domain of a steroid hormone receptor, protein 
kinase C, inositol- 1,4,5-triphosphate receptor, or recoverin. 

79. The indicator of claim 70, wherein the binding protein moiety is calmodulin or a 
calmodulin-related protein moiety. 

80. The indicator of claim 79, wherein the donor fluorescent protein moiety is P4-3, 
EBFP, or WIB, and the acceptor fluorescent protein moiety is S65T, EGFP, or 
10c. 
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8 1 . The indicator of claim 79, wherein the target peptide riioiety is a cahnodulin- 
binding domain of skMLCKp, smMLCK, CaMKII, Caldesmon, Calspermin, p- 
hosphofructokinase calcineurin, phosphorylase kinase, Ca2+-ATPase 59 kDa 
PDE, 60 kDa PDE, nitric oxide synthase, type I adenylyl cylclase, Bordetella 
pertussis adenylyl cyclase, Neuromodulin, Spectrin, MARCKS, F52, Jl-Adducin, 
HSP90a, HIV-1 gpl60, BBMHBI, Dilute MHC, Mastoparan, Melittin, Glucagon, 
Secretin, VIP, GIP, or Model Peptide CBP2. 

82. The indicator of claim 81, wherein the taiget peptide moiety is the cahnodulin- 
binding domain of skMLCK. 

83. The indicator of claim 82, wherein the linker moiety is -Gly-Gly-. 



84. A fluorescent indicator comprising: 
a target peptide moiety; 

a binding protein moiety having an analyte-binding region which binds an 
analyte and causes the indicator to change conformation upon exposure to the 
analyte, and a peptide-binding region for bmding the target peptide moiety; 

a linker moiety that covalently couples the binding protein and the target 
peptide moiety and is a peptide moiety; 

a donor fluorescent protein moiety covalently coupled to the binding 
protein moiety; and 

an acceptor fluorescent protein moiety covalently coupled to the target peptide 
moiety, 

wherein the donor moiety and the acceptor moiety change position relative 
to each other when the analyte binds to the analyte-binding region, altering 
fluorescence resonance energy transfer between the donor moiety and the 
acceptor moiety when the donor moiety is excited, and the indicator is a single 
polypeptide. 
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85. The indicator of claim 84, wherein the donor fluorescent protein moiety and the 
acceptor fluorescent protein moiety are ^e^iiorea-related fluorescent protein 
moieties. 



86. The indicator of claim 85, wherein the donor fluorescent protein moiety is P4-3, 
EBFP, or WIB, and the acceptor fluorescent protein moiety is S65T, EGFP, or 
10c. 



87. The indicator of claim 86, wherem the binding protein moiety is calmodulin or a 
calmodulin-related protein moiety. 

88. The indicator of claim 87, wherein the target peptide moiety is M13. 

89. The indicator of claim 88, wherein one of the donor fluorescent protein moiety or 
the acceptor fluorescent protein moiety is located at the caiboxy terminus of the 
single polypeptide and the other of the donor fluorescent protein moiety or the 
acceptor fluorescent protein moiety is located at the amino termmus of the single 
polypeptide. 

90. The indicator of claim 84, wherein the indicator further comprises a nuclear 
localization signal, an endoplasmic reticuliun localization signal, a peroxisome 
localization signal, a mitochondrial localization signal, or a localized protein. 



.getthepatentxom/Login.do| |^fink9 9/Fetch/W0984Q477 c^ li^jpi ir hrilli iiii] 



WO 98/40477 PCr/US98/04978 



- 84 - 

91. A method for detennining the concentration of an analyte in a sample 
comprising: 

contacting the sample with a fluorescent indicator comprising a binding 
protein moiety having an analyte-binding region which binds an analyte and 
causes the indicator to change conformation upon exposure to the analyte, a 
donor fluorescent protein moiety covalenlly coupled to the binding protein 
moiety, and an acceptor fluorescent protein moiety covalently coupled to the 
binding protein moiety, wherein the donor moiety and the acceptor moiety 
change position relative to each other when the analyte binds to the analyte- 
binding region, altering fluorescence resonance energy transfer between the 
donor moiety and the acceptor moiety when the donor moiety is excited; 
exciting the donor moiety; and 

determming the degree of fluorescence resonance energy transfer in the 
sample corresponding to the concentration of the analyte in the sample. 



92. The method of claim 91, wherein the step of determining the degree of 
fluorescence resonance energy transfer in the sample comprises measuring light 
emitted the acceptor fluorescent protein moiety. 

93. The method of claim 91, wherein determining the degree of fluoiescence 
resonance energy transfer in the sample comprises measuring light emitted from 
the donor fluorescent protein moiety, measuring light emitted fix)m the acceptor 
fluorescent protein moiety, and calculating a ratio of the light emitted from the 
donor fluorescent protein moiety and the light emitted from the acceptor 
fluorescent protein moiety. 

94. The method of claim 91 , wherein the step of determining the degree of 
fluorescence resonance energy transfer in the sample comprises measuring the 
excited state lifetime of the donor moiety. 
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95. The method of claim 93, further comprising the steps of determining the 
concentration of the analyte at a first time after contacting the sample with the 
fluorescence indicator, determining the concentration of the analyte at a second 
time after contacting the sample with the fluorescence indicator, and calculating 
the difference in the concentration of the analyte at the first time and the second 
time, whereby the difference in the concentration of the analyte m the sample 
reflects a change in concentration of the analyte present in the sample. 

96. The method of claim 95, fiirther comprising the step of contacting the sample 
with a compound between the first time and the second time, whereby a 
difference in the concentration of the analyte in the sample between the first time 
and the second time indicates that the compound alters the presence of the 
analyte. 

97. The method of claim 91, wherein the sample comprises an intact cell and the 
contacting step comprises incorporating the fluorescent indicator into the cell. 

98. The method of claim 97, wherein step of incorporating the fluorescent indicator 
into the cell includes transfecting the cell with an expression vector comprising 
expression control sequences operably linked to a nucleic acid sequence coding 
for the expression of the fluorescent indicator. 

99. The method of claim 91, wherein the analyte is calcium. 

1 00. The method of claim 9 1 , wherein the donor fluorescent protein moiety and 
the acceptor fluorescent protein moiety are Aequorea-xelated fluorescent 
protein moieties. 



4 

3 
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101. The method of claim 100, wherein the indicator further includes a target 
peptide moiety and a linker moiety that covalently couples the binding 
protein and the target peptide moiety and is a peptide moiety, and the 
binding protein moiety further includes a peptide-binding region for 
binding a target peptide moiety. 

1 02. The method of claim 101, wherein the indicator is a single polypeptide. 

1 03 . The method of claun 9 1 , wherein the binding protein moiety is cabnodulin 
or a calmodulin-related protein moiety, the donor fluorescent protein 
moiety is P4-3, EBFP. or WIB, the acceptor fluorescent protein moiety is 
S65T, EGFP, or 10c, and the target peptide moiety is M13. 

104. The method of claim 103, wherein the indicator further comprises a 
localization sequence. 
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FIG. 7 ^ 

(SEQ ID. N0:1) 

ATGCrra^GCA.i^CXK3CGAGGAGCTGTTaiCCGGGGTGGTGCCCATCCTGGTCGAGCT(K5.^CGGC^ 
CGTAA.!lCC-GCaiCAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCa!^TGCCACCTACGGC^^^ 

CCCTGAAGTTCATCTGCACOVCCGGCAAGCTXSCCCGTXKTCCTGGCCCaccCTCGTGACCaCCCT^ 
ACCC\TGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATa^.AC-CAGCACGACTrC^ 

CGCCATGCCCGAAGGCTACGTCC^GGAGCGCa.CCATCTTCTTCAAGGACGACGGCAACTAC^^^ 
CCCGCGCCG.aGGTGAAGTTCGAQGC^rGACACCCTC^-Ta'^^CCGCa^^^^ 

TTCAAGGAGGACGGCAACATCCTGGGGCAau^GCTGGAGTACAACTTaUicaGCCAC^ 
T.ATCaTGGCCGACA.iVGCAGAAGAACGGa^TCAAC-GTa^ACTTCaji^ATCCGCCA 

acgc-cagcgtc<:agctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctg 

CTGCCCXi:^CaACCACTACCTGAGCACCCAGTCCGCCCnX3AGCAAAG.\CCCCAACaa^^^^ 
TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCCGCATGCATGACCAACrGACAGAAa^^^^^ 
TTGCAGAGTTCyU^GAAGCCTTCTCOTATTCGACAAGGATGGGGACC^ 
GAACTTGC<y^CCGTTATaAGGTC6CTTGGACA.iUU^CCCAACGGAAGCAa^ 

GAAAAATa^aGGACiVaiaACAGCGAAaAGGA.aATCCGAGAAGC\TTCC^^ 
GGGAACGr<TACATCAGCC<rrGCTG.:^AT7ACG7CACGTa^.TGAC:A.i_^^^^ 

AaaTG.iUVGAJVGTTGATGAA.ATaiVTAAGGGAAGCAGATATCGATGGTa\TGGCCAAGT^^^^^^ 
AAGAGTTTGT.ACAAATG.ATa^CAC<:.3L'LiiGGGGGGG.3UiGAGGCGCTGG^^ 

GTCAGCGCTGCCaACCGGTTCAAGAAGATCTCCGAGCTCAT--GTGAGCA.AGGGC6AGGAGCTGTT 

CACCGGC^:-TGGTGCCCATCCTC-GTCGA6CTGGACC-GCGACGT.ajUiCGGCCACAAGTTCAG^^ 

CCC-GCGAGGGCGAGC-GCGATC<CACCTACC<;CAAGCTGACCCTG.ajlGTTCATCTGCAC<:ACC 

iiAGCTGCCCGTGCCCTGGCCCACCCTCGTGACaVCCCTGACCTACaSCGTGCAGTGCTTCAGCCG 

CTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGG 

AGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGC 

GACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG 

GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGfiACG 

GCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGGAGCGTGCAGCTCGCCGftCCAC 

TACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCAC 

CCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGai^GTTCGTGA 
CCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA 
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FIG. 7 (continued) 

(SEQ ID. N0:2) 

MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTL 

TKGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGID 
FKEDGNII^-KKLEYNF^ISKNVYIMADKQKNGIKV^^ 

LPDNOTLSTQSALSXDPNEKSDHMVLI^FVTAARMEDQLTEEQIAEFKEAFSLFDKDGDGTITTK 
ELGTVMRSI,GQNPTEAELQDMINEVDADGNGTIYFPEFLT^miUCMKDTDSE32IR^ 

GNGYISAA2LR.WMTNIX;EKLTDEEVDEMIREADIDGDGQVNYEEFVQMMTAKG^ 

VSAANRFXKISELMVSKGEELFTGVVPILVBLDGDVNGHKFSVSGEGEGDATYGXLTLKFICTTG 

klpvpwptlvttltygvqcfsrypdhmkqkdffksampegyvqertiffkddgnyktraevkfeg 

DTLVlmIEL.KGIDFKEDGNILGKKLEYNYNSK^VYI^IADKQKNGIKVNFKIRENI2DGSVQL^ 
YQQOTPIGDGPVIiPDNHYLSTQSALSKDPNEKSDEMVLLEFVTAAGITLG>IDELrK* 
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FIG. a 

(SEQ ID. NO: 3) 

ATGGTGAGCA.i^GGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTC-GTCGAGCTGGACGGCGA 
CGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGA 
CCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTG.\CCACCCTG 
ACCCATC^KZGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGaiGCACGACTTCTTCAAGTC 
CGCCATGCCCGAAGGCTACGTCC\QGAGCGCACCATCTTCTTCAAGGACG.aCGGC:AACTACajW^ 

CCCGCGCCa3VGGTGAAGTTCGAGGGCG.aCACCCTC-GTGAACCGCATCGAGCTGAAGGGCATCGAC 

TTCAAGGAGa\CGGCAACATCCTGGGGCACAAGCTC<5AGTACA.^CTTCAiiCAGCCAC^ 

TATCATGGCCGAOUlGCAGAAGAACGGCATCAAGGTGAACTTCAaXLWCCGCCACa^^ 

ACGGCAGCGTGa^GCTCGCCGACCACTACCAGCAG.aACACCCCa^TCGGCGACGGCCCCGTC-CTG 

CTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAG.!^CCCCAACGAGAAGCGCGA 

TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCCGCATGCATGACCaACTGACAG.aAGAGaVGA 

TTGCAGAGTTCAAAGAAGCCTTCTCATTATTCGACAAGGATGGGGACGGC^CCATCACCACAAAG 

GAACTTGGCACCGTTATGAGGTCGCTTGGACAAAACCCRACGGA\GCAa!UlTTGC:AGG.2lTATGAT 

CAATGAAGTCGATGCTGATGGCAATGG.AACGATTTACTTTCCTGAATTTCTTACT.3VTGATGGCT^^ 
G.2UUULA.TGA!«KaCACAGAC^GCGAAG^^ 

GGG.iJiCGGCTAaVTCiGCGCTGCTaaATTACGTCACGTCATGACA.aj^X^ 

Aa\TGA^GaAGTTGATGJVAATa\TAAGGQAAGCAGATATCG.^TGGTGATGGCCAAGTA^ 

A.5i^GTTTGTACAAATGATGACAGCAAi^GGGGGGGAAGAQGCGCTGG?LAGAAAAACrTCATTGCC 

GTC3VGCGCTGCaLa.CCGGTTCAAGAAGATCTCCGAGCTCATGGTGAGCAAGGGCGAGGAGC^^ 

CACCGGGGTGGTGCCC\TCCTGGTCGAGCTGGACGQCa\CGTAA\CC-GCCACAAGT^ 

CCGGCa^VGCGCGAGGGCGATGCCACCTACGGOUVGCTGACCCTGaAGXTCATCTGCACCACCGGC 

AAGCTGCCCGTGCCCTGGCdCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTa^GCCG 

CTACCCCQACCACATGAAGaVGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGG 

AGCGCACaVTCTTCTTCAAGGACGACGGaVACrACAA(aCCCGCGCCGAG6T6AAGTTCG^ 

GACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG 

GCACAAGCKKSAGTACAACTACAACAGCCACRACGTCTATATCATGGCCGAa^GaVGA^ 

GCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGC^GCrCMCCGSCC^^ 

TACCAGCAGAACACCCCC^TCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCAC 

CCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGA 

CCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCCAAAAAAGAAGAGA.AAGGTGGAA 
GACGCTTAA 
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FIG. 8 (continued) 

(SEQ ID. N0:4) 

^fVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPV?WPTLVTTL 

THGVQCFSRYPD^mQmPFKSAMPEGYVQERTIFFXDDGNYKTRAEVKFEGDTLVNRIELK^ 

FKEDGNILGHKLEYNFNSHNVYIMADKQKNGIKVNFKIRKNIEDGSVQIJ^ 

LPDNHYLSTQSALSKDPNEKRDKMVLLEFWAAIMHDQLTEEQIAEFKSAFSLF^ 

ELGTV>IRSLGQNPTEAELQDMINEVDADGNGTIYFPEFLTMMARKMKDro^ 

GNGY I S AAELRKVMTNLGSKLTDEE VDEMI READ I DGDGQVNYEEFVQMMTAKGGKRRWKKNF I A 

vsaanrf:c<iselmvskgeelftgvvpilveldgdvnghkfsvsgegegdatygklt^ 

KLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEG 

DTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQI^ 

YMOTPIGDGPVLLPDNKYLSTQSALSKDPNEKRDKMVLLEFVTA^^^ 

DA* 
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FIG. 9 



(SEQ ID. N0:5) 

ATC<5TGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGC^ 
CGTAAACGGCGACAAGTTOVGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT^^^ 

CCCTGAAGTTC^tCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCC^CCCTCGTGACCAC^ 

ACCCATGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATaAAGCAGC^^ 

CGCC^TGCCCGAAC-GCTACGTCC^GaJVGCGCACCATCTTCTTCA^ 

CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGOITCGAGCTGAAGGGCATCGAC 
TTC^-GAGaACGGCAACATCCTGGGGaiCAA.GCTGGAGTAOAOT 

TATC2^TGGCCa\CAAGCAGAAGAACGGa^TCAAGGTGAACTTCAAGATCC^ 

ACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAaAACACCCCCATCGGCGACGG^ 

CTGCCCGACAACCACTACCTGAGGACCCAGTCCGCCCTGAGCAA^aACCCa^^^^ 

TCA.CATGGTCCTGCTGGAGTTCGTGACCGCCGCCCGC\TGCATGACC^ 

TTGC^GAGTTCAAAGAAGCCTTCTCATTATTCGACAAGGATGGGGACGG^ 

GAACTTGGa^CCGTTATGAGGTCGCTTGGACAAAACCCAACGa^2i.GaVGA^^ 

CAATGAAGTCGATGCTGATGGCAATGGAACGATTTACTTTCCTGAATTTCTTACT^^ 

GGGAACGGCTAC^TCAGCGCTGCTCAGTTACGTCACGTCATC^^CAAAC 
Aa\TGAAGAAGTTGATGAAATGATA\GGGAAGC\GAT^^ 

AAGAGTTTGTAC\?UTGATGACAGCAAAGGGGGGG2U^GAGGCGCTGGAAGAAAAACT^ 
GTCAGCGCTGCCAACCGGTTCAAGAAGATCTCCGAGCTCATGGTa^GOLAGGGCGAC-^^ 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC^CAAGTTCAGCGTGT 
CCGGCGAGGGCGAGGGCa^TGCOVCCTACGGOUlGCTGACCCrGAAGTTCATCTGC^^^ 

AAGCTGCCCGTGCCCTC-GCCCACCCTCGTGACCACCCTGACCTACC-GCGTGCAGTGCTTCAGCCG 
CTACCCCGACCACATGAAGC2^GC\CGACTTC!TCAAGTC^ 

AGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAG^ 

GACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACG<^^ 

GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCAT^^ 

GCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCG^ 
TACCAGC\GAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTG^^ 
CCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGA 
CCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA 
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FIG. 9 (continued) 

(SEQ ID. NO: 6) 

MVSKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTL 

THGVQCFSRYPDHMKQHDFFXSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGID 
FKEDGNILGHaEYNFNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPI 

LPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAR^IHDQLTEEQIAEFKEAFSLFDB1)GD6TITC 

ELGTVMRSIXSQOTTEASLQDMINEVDADGNGTIYFPEFLTMMARKMKDTDSEEEIREAFRV^^ 

GNGYISMQ^JU^V^mILGEKLTDEBVDEMIREADIDGD<kJVOTEEFVQ^^^ 

VSAANRFmSSLMVSKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTG 
KLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGiJYKTRASVKFEG 
DTLVNRIELKGIDFKEDGNIIXSHXLEYNYNSHNVYIMADKQIQIGIKVNFKIRKtllETCSV 
YQQNTPIGDGPVUiPDNHYLSTQSALSIODPNEKjaJHMVUJEFVTJUVGITIXSMDELYK* 
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FIG. 10 

(SEQ ID. NO: 7) 

ATGCTGCTGCCCGTCCCCCTGCTGCTGGGCCTGCTGGGCGCCGCCGCCGACGTGAGa^A 
GGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCAOVAGT 

tca.gcgtgtccggcgagggcgagggcgatgcc::acctacggcaagctgaccctgaagttcatctgc 
acc\ccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgaccc\tg^ 

cttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggct 

ACGTCa^GGAGCGC^CCATCrTCTTCAAGaaca^^CGGCAACTAC^^^^ 

TTCGAGGGCGACACCCTGGTGA\CCGCATCGAGCTGAAGGGC\TCa^-CTTCAAGGAGGACGGC^ 

CATCCTGGGGCACiL'lGCTGGAGTACAACTTCAACAGCCACAACGTCTATATC^ 

AGA\GAACGGCATCAA.GGTGAA.CTTCA\GATCCGCCACAACATCa\GGACGGC^ 

GCCGACCACTACC^GCAGAACACCCCa^TCGGCiGACGGCCCCGTGCTGCTGCCCGACaACCACTA 

CCTGAGC\CCaVGTCCGCCCTa2VGCA]^GACCCCaACGAGAAGCGCGATCACATGGTCCTGCTC 

AGTTCGTa^CCGCCGCCCGCATGCATGACCA^^CTGACAGAAGAGCAGATTGCAaAGTTC^^^ 

GCCTTCTO^TTATTCGACAAGGATGGGGACGGCACCATCACCACAJL^GGAACTTGGCA 

GAGGTCGCTTGGACAAAACCCAACGGAAGC\GAATTGCAGGATATGATCAATGA\GTCGATG 

ATGGCJL\TGGA^XGATTTACTTTCCTGA\TTTCTTACTATGATGGCTAGA?^^ 

GAC^GCGA^GAGGAAATCCGAGA^GCATTCCGTGTTTTTGACAAGGATGGGAACGGCTACATC^^ 

CGCTGCTC2lGTTACGTCACGTCATGAC^JIACCTCGGGGAGAAGTTA^CAGATGA\^^ 

AAATa^lTAAGGGAAGCAa^TATCGATGGTGATGGCCA^GTAAAC^^ 

ATGACAGaUUlGGGGGGGAAGAGGCGCTGGA\GA?Ul2UV(nTCA^ 

GTTCAAGAAGATCTCCGAGCTCATGGTGAGaL^lGGGCGAGGAGCTGTTC\CCGGGG 

TCCTGGTCGAGCTGGACGGCGACGTA^L^CGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGC 

GATGCC\CCTACGGCAAGCTGACCCTGAAGTTCATCTGC\CC\CCGGC^AGCTGCCCGTGCCCTG 

GCCaVCCCTCGTGACC^CCCTGACCrACGGCGTGCAGTGCTTaVGCCGCTACCCCaA.CCACATGA 

AGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCO^^ 

AAGGACGACGGOUICTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGA^ 

CATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC^^ 

ACTAOUVCAGCCACAACGTCTATATCATGGCaSACAAGCAGAAGAACGG^^ 

AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCC 

CATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGC\CCCAGTCCGCCCTGAGCA 

AAGACCCCAACQAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACT 
CTCGGCAAGGACGAGCTGTAA 
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FIG. 10 (continued) ^ 

(SEQ ID. N0:8) 

MLLPVPLLLGLLGAAADVSKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIC 

TTGKLPVPWPTLVTTLTHGVQCFSRYPDH^IKQHDFFKSAMPEGYVQERTIFFKDDG^TYCT 

FEGDTLVNRIELKGIDFKEDGNIIiGHKLEYNFNSHNVyIMADKQKNGIKV^^^ 

ADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAA 

AFSLFDKZDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTIYFPEFL^^ 

DSEEEIREAFRVFDiQGNGYISAAQLRHVMTNLGEiaiTbEEVDEMIRE^ 

MTAKGGKRRWKKNFIAVSAANRFKiaSELMVSKGEELFTGWPILVELDGDV^^ 

DATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQKDFFKS^^ 

KDIX3NYKTRAEVXFEGDTLVNRIELKGIDFKEDGNILGHKLEYNY>TS^^^ 

KIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKSDH^ 
LGKDEL* 
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